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METHODS ]\ND MEANS FOR MODIFICATION 
OF PLANT CHARACTERISTICS 

The present invention relates generally to modification of 
5 characteristics of plants, especially vernalization response, 
flowering time, leaf shape and/or shade avoidance response, 
and has arisen on the basis of cloning of the VRN2 gene and 
mutant alleles of Arabidopsis thaliana, and identification of 
homologues in other species. 

10 

In different embodiments, the present invention provides for 
manipulation of flowering time and/or other characteristics of 
plants, e.g. by up or down regulating VRN2 gene expression. 
The present invention also provides for modification of the 
15 extent of alteration of a relevant plant characteristic 
through the use of gene alleles, mutants and variants. 

Plants must integrate a wide variety of environmental signals 
in order to maximize their reproductive success. Part of this 

20 integration must involve perception of the seasons, both to 

ensure the plant flowers during the correct season (for which 
it is adapted) and to synchronise its flowering with other 
members of its own species, to increase the chances of cross- 
fertilization. Arabidopsis thaliana serves as a model plant, 

25 for it exhibits responses to a wide variety of environmental 
stimuli that are observed in many species. Flowering in 
naturally occurring strains (ecotypes) of Arabidopsis can be 
promoted either by long days (increased photoperiod) or by 
vernalization, a long cold treatment that mimics the cold of 



winter. While many aspects of the photoperiodic response are 
now understood, the vernalization pathway has received 
relatively less attention. The inventors have used a late 
flowering, vernalization responsive mutant of Arabidopsis, the 
fca mutant, as a background in which to isolate mutants that 
exhibit a reduced vernalization response, the VRN mutants. 

Vernalization is the low temperature promotion of flowering. 
It can also be thought of as the cold aspect of the winter 
season, which also includes reduced daylight hours. Many 
species of plants that grow in temperate or cooler climes have 
an obligate requirement for vernalization in order to flower. 
Such plants typically germinate in autumn, and over winter as 
vegetative plants, and flower in milder conditions of spring. 
Vernalization thus acts as a cue, to allow plants to sense the 
seasons, and to time their flowering to maximise their chance 
of reproductive success. 

Species ror which flowering is important to crop production 
are numerous, essentially all crops which are grown from seed, 
with important examples being the cereals, rice and maize, 
probably the most agronomical ly important in warmer climatic 
zones, and wheat, barley, oats and rye in more temperate 
climates. Important seed products are oil seed rape, sugar 
beet, maize, sunflower, soybean and sorghum. Many crops which 
are harvested for their roots are, of course, grown annually 
from seed and the production of seed of any kind is very 
dependent upon the ability of the plant to flower, to be 
pollinated and to set seed. In horticulture, control of the 



timing of flowering is important. Horticultural plants whose 
flowering may be controlled include lettuce, endive and 
vegetable brassicas including cabbage, broccoli and 
cauliflower, and carnations and geraniums. 

Arabidopsis thaliana is a facultative long day plant, 
flowering early under long days and late under short days. 
Because it has a small, well -characterized genome, is 
relatively easily transformed and regenerated and has a rapid 
growing cycle, Arabidopsis is an ideal model plant in which t- 
study flowering and its control. 

In addition to cloning the VRN2 gene, the inventors have 
unexpectedly found indication that VRN2 is a transcription 
factor, which in itself opens several exciting avenues for 
application of the present invention. Without being bound by 
theory or in anyway limiting the scope of the present 
invention, VRN2 may be required for a normal vernalization 
response because it acts as a regulator of genes that 
ultimately lead to the transition from vegetative to 
reproductive growth. In such a model, cold, or a downstream 
molecule involved in cold perception, may regulate the 
activity or expression of the VRN2 protein, which in turn may 
regulate the expression of a large number of genes that 
ultimately lead to flowering. Furthermore, the shade 
avoidance phenotype exhibited by the V2rn2-1 mutant., as 
demonstrated experimentally below, provides indication that 
VRN2 also plays a role in regulating leaf shape, particularly 
in response to increased far-red light. Together, these two 



4 

processes affected by a deficiency or reduction of VRN2 
activity provide for a number of approaches of agronomic 



interest , 



5 First, forced expression of VRN2 (for example under the 

control of a strong and constitutive promoter, such as the Ca 
MV 35 S promoter) in a wild-type background may be used alter 
the vernalization requirement of a plant prior to flowering. 
As a large number of commercial cultivars of several species, 
10 including (diploid) wheat, barley, and sugar beet, have a 

requirement for vernalization to flower, modification of this 
requirement, by reducing the duration of vernalization 
required, or changing the optimum temperature, or abrogating 
the requirement altogether, is of agronomic usefulness. For 
L5 instance, a winter crop that can be sown and left in the 
ground for a shorter period than usual (i.e. a reduced 
vernalization time) may benefit from reduced risk of damage 
associated with severe winter weather conditions, as the crops 
are exposed to winter conditions for a shorter time. 



20 



Second, down-regulation of VRN2 expression, for instance by 
means of an antisense VRN2 cDNA, may be used to recapitulate 
the reduction in shade avoidance phenotype observed in vrn2-l 
mutants. This may be used in situations where crowding of the 
25 crop is a problem. Based on experimental evidence provided 
herein on the phenotype of vrn2-I mutants, such plants are 
expected to exhibit less of a response to such conditions, and 
to produce leaves that are essentially normal i.e. as if they 
had not been grown in dense or crowded conditions. 




Third, the individual isolated domains of the VRN2 protein may 
be used in their own right . DNA binding of the zinc finger of 
VRN2 may be used to direct or control gene expression in a 
precise manner. The VRN2 zinc finger may recognize specific 
5 DNA sequences that represent elements in the promoters of its 
normal target genes. By creating fusion proteins, comprising 
the DNA binding (zinc finger) domain of VRN2 , and an 
activation or repression domain from a heterologous protein, 
the expression of VRN2 target genes may be controlled. This 

10 allows for a precise control of the expression of those genes 
that are normally targets of VRN2 . Given that such genes, 
alone or in combination, ultimately control the transition to 
flowering (usually following vernalization) , their directed 
expression in other conditions may also be used to elicit 

15 changes in flowering and/or one or more other plant 

characteristics. The response of a plant to far-red light may 
be controlled using VRN2 fusion proteins to control the 
expression of (far-red responsive) target genes. Furthermore, 
the use of the zinc finger domain of VRN2 in conventional 

2 0 SELEX or one -hybrid experiments may be used to reveal the 
target genes or DNA sequences normally bound by VRN2 . 

The acidic activation domain of VRN2 may be used to regulate 
the activity of a fusion protein, including a DNA-binding 
25 protein of known specificity, and the activation domain of 

VRN2 . This allows for regulation of target genes of other DNA 
binding proteins involved in flowering, or of target genes in 
completely unrelated processes. 




The inventors have cloned, characterised and manipulated the 
VRN2 gene of Arabidopsis thaliana, both Columbia and Landsberg 
erecta types, and identified alternatively spliced and mutant 
forms, also homologues in other species. 

5 

In the light of the inventors' experimental work, a first 
aspect of the present invention provides a nucleic acid 
isolate encoding a polypeptide including a VRN2 amino acid 
sequence shown herein (e.g. SEQ ID NO: 2; SEQ ID NO: 5), which 
10 may include a coding sequence shown herein (e.g. SEQ ID NO: 1; 
SEQ ID NO: 4) . 

Allelic forms and alternatively spliced forms of the gene have 
be en identified. Such polypeptides and encoding nucleic acid 
15 (e.g. as in SEQ ID NO: 8, encoded by SEQ ID NO: 7) are each 
further provided as an aspect of the invention, as are 
polypeptids and nucleic acid including the mutations 
identified herein. 

2 0 Nucleic acid according to the present invention may have the 

sequence of the VRN2 gene of Arabidopsis thaliana as indicated 
in SEQ ID NO: 1, SEQ ID NO: 3 {Landsberg erecta genomic 
sequence), SEQ ID NO: 4 or SEQ ID NO: 6 {Columbia genomic 
sequence), or be a mutant, variant, derivative or allele or a 

25 homologue of the sequence provided. Preferred mutants, 

variants, derivatives and alleles are those which encode a 
protein which retains a functional characteristic of the 
protein encoded by the wild-type gene, especially the ability 
to alter vernalization response, flowering time, leaf shape 
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and/or shade avoidance response. 

A mutant, variant, derivative, allele or homologue in 
accordance with the present invention may have the ability to 
5 affect a physical characteristic of a plant, especially 
vernalization response, flowering time, leaf shape and/or 
shade avoidance response, as discussed. 

Polynucleotides which are not 100% identical to the sequences 
10 shown herein but fall within the scope of the invention can be 
obtained in a number of ways . 

Other VRN2 variants (for example allelic forms) of the gene 
described herein may be obtained for example by probing cDNA 
15 or genomic DNA libraries made from Arahidopsis tha.liana. plants 
or cells. 

In addition, other plant, monocot or dicct, homologues of the 
gene may be obtained. Such sequences may be obtained by 

2 0 making or obtaining cDNA libraries made from dividing cells or 
tissues or genomic DNA libraries from other plant species, and 
probing such libraries with probes comprising all or part of a 
nucleic acid of the invention under conditions of medium to 
high stringency (for example for hybridization on a solid 

25 support (filter) overnight incubation at 42°C in a solution 
containing 50% formamide, 5xSSC (750 mM NaCl , 75 mM sodium 
citrate), 50 mM sodium phosphate (pH7.6), 5x Denhardt ' s 
solution, 10% dextran sulphate and 20 /ag/ml salmon sperm DNA, 
followed by washing in 0 . 03M sodium chloride and 0.03M sodium 



citrate (i.e. 0 . 2x SSC) at from about 50°C to about 60°C) . 

Thus the present invention provides an isolated nucleic acid 
which hybridizes to the nucleotide sequence shown in a figure 
5 herein under the abovementioned hybridization and washing 
conditions. Such a nucleic acid is suitable for use as a 
probe for detecting the VRN2 gene, for example in Southern 



blots . 



10 Suitable probe and primer sequences are disclosed herei 



m . 



Alternatively, polynucleotides of the invention may be 
obtained by site directed mutagenesis of the sequences of 
shown in the figures or allelic variants thereof. This may be 
15 useful where for example silent codon changes are required to 
sequences to optimise codon preferences for a particular host 
cell in which the polynucleotide sequences are being 
expressed. Other sequence changes may be desired in order to 
mcroduce restriction enzyme recognition sites, or to alter 
20 the property or function of the polypeptides encoded by the 
polynucleotides. Further changes may be desirable to 
represent particular coding changes which are required to 
provide, for example, conservative substitutions. 

25 In the context of cloning, it may be necessary for one or more 
gene fragments to be ligated to generate a full-length coding 
sequence. Also, where a full-length encoding nucleic acid 
molecule has not been obtained, a smaller molecule 
representing part of the full molecule, may be used to obtain 
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full-length clones. Inserts may be prepared from partial cDNA 
clones and used to screen cDNA libraries. The full-length 
clones isolated may be subcloned into expression vectors and 
activity assayed by transfection into suitable host cells, 
5 e.g. with a reporter plasmid. 

The present invention also extends to nucleic acid comprising 
transcriptional control sequences for the VRN2 gene. Such 
control sequences will be found 5' to the open reading frame 

10 of the gene and are obtainable by probing a genomic DNA 

library with a nucleic acid of the invention, selecting a 
clone which hybridizes under conditions of medium to high 
stringency, and sequencing the clone 5' to the open reading 
frame of the gene. Where only a small amount of sequence is 

15 present in the 5' region, this sequence may be used to reprobe 
the library to genome walk further upstream. Analysis of the 
upstream region will reveal control regions for gene 
expression including control regions common to many genes (i.e 
TATA and CAAT boxes) and other control regions, usually 

20 located from 1 to 10,000, such as 1 to 1000 or 50 to 500 
nucleotides upstream of the start of transcription. 

To confirm that such regions are the control regions of the 
gene, they may be linked to a reported gene (such as (3- 
25 galactosidase) and tested in any suitable in vitro or in vivo 
system. For example the construct of the control region (e.g. 
comprising 50 to 500 nucleotides upstream of the start of 
transcription) and the reporter gene may be used to produce a 
transgenic plant and the pattern of expression, both spatially 



10 

and developmentally, may be compared with that of the VRN2 
gene. Where substantially similar patterns of expression are 
found, this shows that the construct comprises substantially 
all of the control region of the wild type gene. 

SEQ ID NO: 3 and SEQ ID NO: 6 show the nucleotide sequence of 
the VRN2 genomic region including promoter, respectively for 
Landsberg areata and Columbia ecotypes of Arabidopsis 
thaliana, also 3' regulatory elements. 



10 



The control region may be mutated to identify specific 
subregions responsible for transcriptional control. This may 
be achieved by a number of ' techniques well known in the art as 
such, including DNase protection footprint assays, in which 
15 the control region is brought into contact with an extract 

from a cell in which the VRN2 gene is actively expressed, and 
the regions of the control region which bind factors in that 
extract is determined. 

20 Isolated nucleic acid comprising such control regions 

obtainable by such a method form a further aspect of the 
present invention. 



The present invention further extends to genomic DNA exon 
25 sequences found between the introns encoding a VRN2 gene in 
plant. Such exon sequences may be obtained in a manner 
analogous to that described above for the transcriptional 
control sequences, with the appropriate genome walking being 
conducted between the intron sequences. The locations of the 
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exons may be determined by comparing genomic and cDNA 
sequences of the gene, observing where the sequences line up 
and diverge, and looking for consensus splice sequences which 
define intron/exon boundaries. 

5 

As noted above, changes to a sequence, to produce a mutant, 
^ variant or derivative, may be by one or more of addition, 

insertion, deletion or substitution of one or more nucleotides 
in the nucleic acid, leading to the addition, insertion, 
10 deletion or substitution of one or more amino acids in the 

encoded polypeptide. Of course, changes to the nucleic acid 
which make no difference to the encoded amino acid sequence 
( "degeneratively equivalent") are included. 

15 Preferred nucleic acid sequences according to the present 

invention are shown herein, for instance see SEQ ID NO: 1 and 
SEQ ID NO: 4, of which the respective predicted encoded amino 

acid sequences of pol\^eptides according to the present 
invention are shown in SEQ ID NO: 2 and SEQ ID NO: 5. 

20 

A mutant, allele, variant or derivative amino acid sequence in 
accordance with the present invention may include within a 
sequence shown herein a single amino acid change with respect 
to the sequence shown with the relevant SEQ ID NO: or in the 
25 relevant figure, or 2 , 3, 4,-5, 6, 7, 8, or 9 changes, about 
10, 15, 20, 30, 40 or 50 changes, or greater than about 50, 
60, 70, 80 or 90 changes. In addition to one or more changes 
within the amino acid sequence shown in the relevant figure, a 
mutant, allele, variant or derivative amino acid sequence may 
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include additional am.no acids at the C-terminus and/or N- 
terminus . 



A sequence related to a sequence specifically disclosed herein 
5 shares homology with that sequence. Homology may be at the 
nucleotide sequence and/or amino acid sequence level. 
Preferably, the nucleic acid and/or amino acid sequence share 
homology with the coding sequence or the sequence encoded by 



s 
a 

nucleotide sequence shown herein, for instance SEQ ID 



on 



NO: 2 or 

10 SEQ ID NO: 5, preferably at least about 50%, or 60%, or 70%, 
or 80% homology, most preferably at least about 90%, 95%, 96%, 
97%, 98% or 99% homology. 



As is well-understood, homology at the amino acid level is 
15 generally in terms of amino acid similarity or identity. 
Similarity allows for "conservative variation", i.e. 
substitution of one hydrophobic residue such as isoleucine, 
valine, leucine or methionine for another, or the substituti 
of one polar resxdue for another, such as arginine for lysine 
20 glutamic for aspartic acid, or glutamine for asparagine. 
Similarity may be as defined and determined by the TBLASTN 
program, of Altschul et al . (1990) J. Mol . Biol. 215: 403-10, 
which is in standard use in the art, or, and this may be 
preferred, either of the standard programs BestFit and GAP, 
25 Which are part of the Wisconsin Package, Version 8, September 
1994, (Genetics Computer Group, 575 Science Drive, Madison, 
Wisconsin, USA, Wisconsin 53711) . BestFit makes an optimal 
alignment of the best segment of similarity between two 
sequences. Optimal alignments are found by inserting gaps to 




maximize the number of matches using the local homology 
algorithm of Smith and Waterman (Advances in Applied 
Mathematics (1981) 2, pp. 482-489). GAP uses the Needleman and 
Wunsch algorithm to align two complete sequences that 
5 maximizes the number of matches and minimizes the number of 
gaps. Generally, the default parameters are used, with a gap 
creation penalty = 12 and gap extension penalty = 4 . 



Homology is generally over the full-length of the relevant 
10 sequence shown herein, that is unless stated otherwise, or it 
may be over a contiguous sequence of about or greater than 
about 20, 25, 30, 33, 40, 50, 67, 133, 167, 200, 233, 267, 
300, 333, 400, 450, 500, 550, 600 or more amino acids or 
codons, compared with the relevant amino acid sequence or 
15 nucleotide sequence as the case may be. 



Nucleic acid according to the present invention may consist 
essentially of or consist of the relevant coding sequence. 
Nucleic acid according to the present invention may include a 
20 promoter or other regulatory sequence as discussed further 
elsewhere herein, and such regulatory sequence may be 
heterologous to the coding sequence, that is to say not 
naturally operably linked therewith. Nucleic acid according 
to the present invention may be cDNA or lacking one or more 
25 introns which occur naturally, or may be in any non-naturally 
occurring form. A coding sequence in accordance with the 
present invention may be included with a larger nucleic acid 
molecule of less than about 10,000 nucleotides, less than 
about 5,000 nucleotides or less than about 2,000 nucleotides. 
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Also provided by an aspect of the present invention is nucleic 
acid including or consisting essentially of a sequence of 
nucleotides complementary to a nucleotide sequence 
hybrxdisable w.th any encoding sequence provided herein. 
5 Another way of looking at this would be for nucleic acid 

according to this aspect to be hybridisable with a nucleotide 
sequence complementary to any encoding sequence provided 
herein. Of course, DNA is generally double -stranded and 
blotting techniques such as Southern hybridisation are often 
10 performed following separation of the strands without a 
distinction being drawn between which of the strands is 
hybridising. Preferably the hybridisable nucleic acid or its 
complement encode a product able to influence a physical 
characteristic of a plant, particularly vernalization 
15 response, flowering time, leaf shape and/or shade avoidance 
response, e.g. in Arabidopsis thaliana. Preferred conditions 
for hybridisation are familiar to those skilled in the art, 
but are generally stringent enough for there to be positive 
hybridisation between the sequences of interest to the 
20 exclusion of other sequences. 

The nucleic acid, which may contain for example DNA encoding a 
polypeptide including the amino acid sequence of VRN2 or other 
polypeptide disclosed herein, as genomic or cDNA, may be in 
25 the form of a recombinant and preferably replicable vector, 
for example a plasmid, cosmid, phage or Agrotacteriun, binary 
vector. The nucleic acid may be under the control of an 
appropriate promoter or other regulatory elements for 
expression in a host cell such as a microbial, e.g. bacterial. 




or plant cell. In the case of genomic DNA, this may contain 
its own promoter or other regulatory elements and in the case 
of cDNA this may be under the control of an appropriate 
promoter or other regulatory elements for expression in the 
5 host cell , 

A vector including nucleic acid according to the present 
invent ion need not include a promoter or other regulatory 
sequence , particularly i f the vector is to be used to 
0 introduce the nucleic acid into cells for reconabinat ion into 
the genome . 

Those skilled in the art are well able to construct vectors 
and design protocol s for recombinant gene expression . 

5 Suitable vectors can be chosen or constructed, containing 
appropriate regulatory sequences , including promoter 
sequences , terminator fragments , polyadenylat ion sequences , 
enhancer sequences, marker genes and other sequences as 
appropriate . For further details see , for example , Molecular 

0 Cloning : a Laboratory Manual: 2nd edition, Sambrook et al, 
1989, Cold Spring Harbor Laboratory Press . Many known 
techniques and protocols for manipulation of nucleic acid, for 
example in preparation of nucleic acid constructs, 
mutagenesis, sequencing, introduction of DNA into cells and 

5 gene expression, and analysis of proteins, are described in 
detail in Cur-^^ent Protocols in Molecular Biology, Second 
Edition, Ausubel et al . eds . , John Wiley & Sons, 1992. The 
disclosures of Sambrook et al . and Ausubel et al . are 
incorporated herein by reference. Specific procedures and 



- % 
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vectors previously used with wide success upon plants are 
described by Bevan (Nucl . Acids Res. 12, 8711-8721 (1984)) and 
Guerineau and Mull ineaux (1993) (Plant transformation and 
expression vectors, in: Plant Molecular Biology Labfax (Croy 
5 RRD ed) Oxford, BIOS Scientific Publishers, pp 121-148). 

Selectable genetic markers may be used consisting of chimaeric 
genes that confer selectable phenotypes such as resistance to 
antibiotics such as kanamycin, hygromycin, phosphinotricin, 
10 chlorsulfuron, methotrexate, gentamycin, spectinomycin, 
imidazolinones and glyphosate. 

Nucleic acid molecules and vectors according to the present 
invention may be provided isolated and/or purified from their 
15 natural environment, in substantially pure or homogeneous 

form, or free or substantially free of nucleic acid or genes 
of the species of interest or origin other than the sequence 
encoding a polypeptide with the required function. Nucleic 
acid according to the present invention may include cDNA, 
20 RNA, genomic DNA and may be wholly or partially synthetic. 

The term "isolate" encompasses all these possibilities. where 
a DNA sequence is specified, e.g. with reference to a figure, 
unless context requires otherwise the RNA equivalent, with U 
substituted for T where it occurs, is encompassed. 
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When introducing a chosen gene construct into a cell, certain 
considerations must be taken into account, well known to those 
skilled in the art. The nucleic acid to be inserted should be 
assembled within a construct which contains effective 



regulatory elements which will drive transcription. There must 
be available a method of transporting the construct into the 
cell. Once the construct is within the cell membrane, 
integration into the endogenous chromosomal material either 
will or will not occur. Finally, as far as plants are 
concerned the target cell type must be such that cells can be 
regenerated into whole plants . 

Plants transformed with the DNA segment containing the 
sequence may be produced by standard techniques which are 
already known for the genetic manipulation of plants. DNA can 
be transformed into plant cells using any suitable technology, 
such as a disarmed Ti-plasmid vector carried by Agrobacterium 
exploiting its natural gene transfer ability (EP-A-270355 , EP- 
A-0116718, NAR 12(22) 8711 - 87215 1984), particle or 
microprojectile bombardment (US 5100792, EP-A-444882, EP-A- 
434616) microinjection (WO 92/09696, WO 94/00583, EP 331083, 
EP 175966, Green et aJ . (1987) Plant Tissue and Cell Culture, 
Academic Press), electroporat ion (EP 290395, WO 8706614) other 
forms of direct DNA uptake (DE 4005152, WO 9012096, US 
4684611), liposome mediated DNA uptake (e.g. Freeman et al . 
Plant Cell Physiol, 29: 1353 (1984)), or the vortexing method 
(e.g. Kindle, PNAS U,S.A, 87: 1228 (1990d) Physical methods 
for the transformation of plant cells are reviewed in Oard, 
1991, Biotech. Ad\r , 9: 1-11. 

Agrobacterium transformation is widely used by those skilled 
in the art to transform dicotyledonous species. There are 
various approaches used for the routine production of stable. 
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fertile transgenic plants in almost all economically relevant 
monocot plants (Toriyama, et al. (1988) Bio/Technology 6, 
1072-1074; Zhang, et al . (1988) Plant Cell Rep, 7, 379-384; 
Zhang, et al . (1988) Theor Appl Genet 76, 835-840; Shimamoto, 
5 et al. (1989) Nature 338, 274-276; Datta, et al , (1990) 
Bio/Technology 8, 736-740; Christou, et al , (1991) 
Bio/Technology 9, 957-962; Peng, et al . (1991) International 
Rice Research Institute, Manila, Philippines 563-574; Cao, et 
al, (1992) Plant Cell Rep, 11, 585-591; Li, et al . (1993) 

10 Plant Cell Rep, 12, 250-255; Rathore, et al . (1993) Plant 
Molecular Biology 21, 871-884; Fromm, et al , (1990) 
Bio/Technology 8, 833-839; Gordon-Kamm, et al , (1990) Plant 
Cell 2, 603-618; D'Halluin, et al , (1992) Plant Cell 4, 1495- 
1505; Walters, et al , (1992) Plant Molecular Biology 18, 189- 

15 200; Koziel, et ai . (1993) Biotechnology 11, 194-200; Vasil, 
I. K. (1994) Plant Molecular Biology 25, 925-937; Weeks, et 
al, (1993) Plant Physiology 102, 1077-1084; Somers, et al . 
(1992) Bio/Technology 10 , 1589-1594; W092/14828) . In 
particular, Agrobacterium mediated transformation i's now 

20 emerging also as an highly efficient alternative 

transformation method in monocots (Hiei et al . (1994) The 
Plant Journal 6, 271-282) . 



The generation of fertile transgenic plants has been achieved 
25 in the cereals rice, maize, wheat, oat, and barley (reviewed 
in Shimamoto, K. (1994) Current Opinion in Biotechnology 5, 
158-162.; Vasil, et al . (1992) Bio/Technology 10, 667-674; 
Vain et al . , 1995, Biotechnology Advances 13 (4) : 653-671; 
Vasil, 1996, Nature Biotechnology 14 page 702). 
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Micropro j ect ile bombardment, electroporat ion and direct DNA 
uptake are preferred where Agrobacterium is inefficient or 
ineffective. Alternatively, a combination of different 
techniques may be employed to enhance the efficiency of the 
5 transformation process, eg bombardment with Agrobacterium 
coated micropart icles (EP-A-486234 ) or micropro j ect ile 
bombardment to induce wounding followed by co-cultivation with 
Agrobacterium (EP-A- 4 86233 ) . 

0 Following transformation, a plant may be regenerated, e.g. 

from single cells, callus tissue or leaf discs, as is standard 
in the art. Almost any plant can be entirely regenerated from 
cells, tissues and organs of the plant. Available techniques 
are reviewed in Vasil et al . , Cell Cultuire and So/natic Cell 

5 Genetics of Plants, Vol J, JX and III, Laboratory Procedures 
and Their Applications , Academic Press, 1984, and Weissbach 
and Weissbach, Methods for Plant Molecular Biology, Academic 
Press, 1989. 



0 The particular choice of a transformation technology will be 
determined by its efficiency to transform certain plant 
species as well as the experience and preference of the person 
practising the invention with a particular methodology of 
choice. It will be apparent to the skilled person that the 

5 particular choice of a transformation system to introduce 
nucleic acid into plant cells is not essential to or a 
limitation of the invention, nor is the choice of technique 
for plant regeneration. 
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A VRN2 gene and modified versions thereof (alleles, mutants, 
variants and derivatives thereof) , and other nucleic acid 
provided herein, including species ho.ologues, may be used to 
modify vernalization response, flowering txme, leaf shape 
5 and/or shade avoidance response in a transgenic plant. 

Nucleic acid such as a vector as described herein may be used 
for the production of a transgenic plant. Such a plant may 
possess an altered phenotype, particular in terms of 
vernalization response, flowering time, leaf shape and/or 
10 shade avoidance response compared with wild-type (that is to 
say a plant that is wild-type for VRN2 or the relevant 
homologue thereof) . 

The invention further encompasses a host cell transformed with 
15 nucleic acid or a vector according to the present invention, 
especially a plant or a microbial cell. Thus, a host cell, 
such as a plant cell, including heterologous nucleic acid 
according to the present invention is provided. Within the 
cell, the nucleic acid may be incorporated withm the 
20 chromosome. There may be more than one heterologous 
nucleotide sequence per haploid genome. 

Also according to the invention there is provided a plant cell 
having incorporated into its genome nucleic acid, particularly 
25 heterologous nucleic acid, as provided by the present 

invention, under operative control of a regulatory sequence 
for control of expression. The coding sequence may be 
operably linked to one or more regulatory sequences which may 
be heterologous or foreign to the gene, such as not naturally 
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associated with the gene for its expression. The nucleic acid 
according to the invention may be placed under the control of 
an externally inducible gene promoter to place expression 
under the control of the user. 

5 

A suitable inducible promoter is the GST- II -27 gene promoter 
which has been shown to be induced by certain chemical 
compounds which can be applied to growing plants. The 
promoter is functional in both monocotyledons and 

0 dicotyledons. It can therefore be used to control gene 
expression in a variety of genetically modified plants, 
including field crops such as canola, sunflower, tobacco, 
sugarbeet, cotton; cereals such as wheat, barley, rice, maize, 
sorghum; fruit such as tomatoes, mangoes, peaches, apples, 

5 pears, strawberries, bananas, and melons; and vegetables such 
as carrot, lettuce, cabbage and onion. The GST- II -27 promoter 
is also suitable for use in a variety of tissues, including 
roots, leaves, stems and reproductive tissues. 

0 A further aspect of the present invention provides a method of 
making such a plant cell involving introduction of nucleic 
acid or a suitable vector including the sequence of 
nucleotides into a plant cell and causing or allowing 
recombination between the vector and the plant cell genome to 

5 introduce the sequence of nucleotides into the genome. The 
invention extends to plant cells containing nucleic acid 
according to the invention as a result of introduction of the 
nucleic acid into an ancestor cell. 
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The term "heterologous" may be used to indicate that the 
gene/sequence of nucleotides xn question have been introduced 
into said cells of the plant or an ancestor thereof, using 
genetic engineering, ie by human intervention. A transgenic 
5 plant cell, i.e. transgenic for the nucleic acid in question, 
may be provided. The transgene may be on an extra-genomic 
vector or incorporated, preferably stably, into the genome. A 
heterologous gene may replace an endogenous equivalent gene, 
ie one which normally performs the same or a similar function, 
10 or the inserted sequence may be additional to the endogenous 
gene or other sequence. An advantage of introduction of a 
heterologous gene is the ability to place expression of a 
sequence under the control of a promoter of choice, in order 
to be able to influence expression according to preference. 
15 Furthermore, mutants, variants and derivatives of the wild- 
type gene, e.g. with higher or lower activity than wild-type, 
may be used in place of the endogenous gene. Nucleic acid 
heterologous, or exogenous or foreign, to a plant cell may be 
non-naturally occuring in cells of that type, variety or 
2 0 species. Thus, nucleic acid may include a coding sequence of 
or derived from a particular type of plant cell or species or 
variety of plant, placed within the context of a plant cell of 
a different type or species or variety of plant. A further 
possibility is for a nucleic acid sequence to be placed within 
25 a cell in which it or a homologue is found naturally, but 

Wherein the nucleic acid sequence is linked and/or adjacent to 
nucleic acid which does not occur naturally within the cell, 
or cells Of that type or species or variety of plant, such as 
operably linked to one or more regulatory sequences, such as a 
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promoter sequence, for control of expression. A sequence 
within a plant or other host cell may be identifiably 
heterologous , exogenous or foreign . 

5 Plants which include a plant cell according to the invention 
are also provided, along with any part or propagule thereof, 
seed, selfed or hybrid progeny and descendants. A plant 
according to the present invention may be one which does not 
breed true in one or more properties. Plant varieties may be 

0 excluded, particularly registrable plant varieties according 
to Plant Breeders' Rights. It is noted that a plant need not 
be considered a "plant variety" simply because it contains 
stably within its genome a transgene, introduced into a cell 
of the plant or an ancestor thereof . 

5 

In addition to a plant, the present invention provides any 
clone of such a plant, seed, selfed or hybrid progeny and 
descendants, and any part of any of these, such as cuttings, 
seed. The invention provides any plant propagule, that is any 

0 part which may be used in reproduction or propagation, sexual 
or asexual, including cuttings, seed and so on. Also 
encompassed by the invention is a plant which is a sexually or 
asexually propagated off-spring, clone or descendant of such a 
plant, or any part or propagule of said plant, off- spring, 

5 clone or descendant . 

The invention further provides a method of influencing or 
affecting a physical characteristic of a plant, particularly 
vernalization response, flowering time, leaf shape and/or 
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Shade avoidance response, including causing or allowing 
expression of a heterologous nucleic acid sequence as 
discussed within cells o£ the plant. 

5 The invention further provides a method of including 

expression fro. nucleic acid encoding a polypeptide, or a 

.utant, variant, allele or derivative of the sequence, or a 
ho^ologue, according to the disclosure herein, within cells of 
a Plant (thereby producing the encoded polypeptide) , following 
10 an earlier step of introduction of the nucleic acid into a 
cell of the Plant or an ancestor thereof. such a method may 
influence or affect a characteristic of the plant, such as 
vernalization response, flowering time, leaf shape and/or 
Shade avoidance response. This may be used in combination 
15 w.th any other gene, such as transgenes involved in flcwerrng 
(e.g. FCA) or other phenotypic trait or desirable property. 



The pres 
o 
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'he present invention also encompasses the expression product 
i any of the nucleic acid sequences drsclosed and methods of 
".aking the expression product by expression from encoding 
nucleic acid therefore under suitable conditions, which may be 
suitable host cells. Pollowing expression, the product may 
be isolated from the expression system and may be used as 
desired, for instance in formulation of a composition 
25 including at least one additional component. 

The present invention also provides for the production and use 
Of fragments of the full-length polypeptides disclosed herein 
especially active portions thereof. >^ -active portion- of J 
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polypeptide means a peptide which is less than said full 
length polypeptide, but which retains an essential biological 
activity. In particular, the active portion retains the 
ability to alter vernalization response, flowering time, leaf 
5 shape and/or shade avoidance response in a plant, such as 
Arabidopsis thaliana . 

A "fragment" of a polypeptide means a stretch of amino acid 
residues of at least about five to seven contiguous amino 

0 acids, often at least about seven to nine contiguous amino 
acids, typically at least about nine to 13 contiguous amino 
acids and, most preferably, at least about 20 to 30 or more 
contiguous amino acids. Fragments of the polypeptides may 
include one or more epitopes useful for raising antibodies to 

5 a portion of any of the amino acid sequences disclosed herein. 
Preferred epitopes are those to which antibodies are able to 
bind specifically, which may be taken to be binding a 
polypeptide or fragment thereof of the invention v/ith an 
affinity which is at least about lOOOx that of other 

0 polypeptides. 

Among preferred VRN2 fragments according to the present 
invention are the zinc finger domain, DNA binding domain and 
other domains disclosed herein. 

5 

Purified protein according to the present invention, or a 
fragment, mutant, derivative or variant thereof, e.g. produced 
recombinantly by expression from encoding nucleic acid 
therefor, may be used to raise antibodies employing techniques 
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whxch are standard in the art. Antibodies and pol^eptides 
comprising ant.gen-binding fragments of antibodies may be used 
in identifying homologues from other species as discussed 
further below, also in identifying complexes containing VRN2 

5 protein. 



Methods of producing antibodies include immunising a mammal 
(e.g. human, mouse, rat, rabbit, horse, goat, sheep or monkey) 
with the protein or a fragment thereof. Antibodies may be 
10 obtained from immunised animals using any of a variety of 

techniques known in the art and m-irrht- k= 

^ arc, and might be screened, preferably 

using binding of antibody to antigen of interest. For 
instance. Western blotting techniques or immunoprecipitation 
may be used (Armitage et al , 1992, Nature 357: 80-82). 
15 Antibodies may be polyclonal or monoclonal. 

AS an alternative or supplement to immunising a mammal, 
antibodies with appropriate binding specificity may be 
obtained from a recombmantly produced library of expressed 
2 0 immunoglobulin variable domains, e.g. using lambda 

bacteriophage or filamentous bacteriophage which display 
functional immunoglobulin binding domains on their surfaces; 
for instance see WO92/01047. 

25 Antibodies raised to a polypeptide or peptide can be used m 
the identification and/or isolation of homologous 
polypeptides, and then the encoding genes. Thus, the present 
invention provides a method of identifying or isolating a 
polypeptide with the desired function (in accordance with 



embodiments disclosed herein) , comprising screening candidate 
polypeptides with a polypeptide comprising the antigen-binding 
domain of an antibody (for example whole antibody or a 
suitable fragment thereof, e.g. scFv, Fab) which is able to 
bind a polypeptide or fragment, variant or derivative thereof 
according to the present invention or preferably has binding 
specificity for such a polypeptide. Specific binding members 
such as antibodies and polypeptides comprising antigen binding 
domains of antibodies that bind and are preferably specific 
for a polypeptide or mutant, variant or derivative thereof 
according to the invention represent further aspects of the 
present invention, particularly in isolated and/or purified 
form, as do their use and methods which employ them. 

Candidate polypeptides for screening may for instance be the 
products of an expression library created using nucleic acid 
derived from an plant of interest, or may be the product of a 
purification process from a natural source. A polypeptide 
found to bind the antibody may be isolated and then may be 
subject to amino acid sequencing. Any suitable technique may 
be used to sequence the polypeptide either wholly or partially 
(for insta-nce a fragment of the polypeptide may be sequenced) . 
Amino acid sequence information may be used in obtaining 
nucleic acid encoding the polypeptide, for instance by 
designing one or more oligonucleotides (e.g. a degenerate pool 
of oligonucleotides) for use as probes or primers in 
hybridization to candidate nucleic acid, or by searching 
computer sequence databases, as discussed further below. 
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A further a.pect of the present invention provides a method of 
Identifying end cloning ho.ologues fro. plant species 

other than .raMdopsis tiaiiana which method employs a 
nucleotide sequence derived from that sho™ herein Ss 
S discussed above, sequences derived fro™ these may themselves 
be used in rdentifyrng and in olonxng ether sequences. The 
nucleotide sequence information provided herein, or any part 
thereof, may he used in a data-base search to find homologous 
sequences, expression products of which can be tested for 
- ability to influence a plant charaoterrstic . xh.se may have 
abrlity to alter vernali.atron response, flowering time, leaf 
shape and/or shade avoidance response in a plant. 

Alternatively, nucleir sr-i^ i '-u 

y, nucleic acid libraries may be screened using 

techniques well known to those skilled in the art and 
5 homologous sequences thereby identified then' tested. 

The present invention also extends to nucleic acid encoding a 

homologue obtained using a nucleotide sequence derived 
from any of those shown herein. 

0 

in certain embodiments, nucleic acid according to the present 
.nvention encodes a polypeptide which has homology with all or 
part Of v™. amino acid sequence shown herein, in the terms 

discussed already abovp la. ^ f 

ay above (e.g. for length), which homology is 

greater over the length of the relevant part (i.e. fragment) 

than the homology shared between a respective part of the 

amino acid sequence of Ar.,Mopsls t,.U.n., and the other 

sequences shown in Fian-r^ q = 

n Figure 8a or Figure 8b, and may be greater 

than about 5% qreater mr.^^ 

g eater, more preferably greater than about 10% 
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greater, more preferably greater than about 20% greater, and 
more preferably greater than about 30% greater. Thus, to 
exemplify with reference to one embodiment, nucleic acid 
encoding an amino acid mutant, variant or derivative of the 
5 amino acid sequence shown in SEQ ID NO: 2 may be provided 

wherein the encoded amino acid sequence includes a contiquous 
sequence of about 100 amino acids which has greater homology 
with a contiguous sequence of 100 amino acids within the amino 
acid sequence of SEQ ID NO: 2 than any contiguous sequence of 
10 100 amino acids within another sequence shown in Figure 8a or 
8b, preferably greater than about 5% greater homology, and so 
on . 



Similarly, nucleic acid according to certain embodiments of 
15 the present invention may have homology with all or part of a 
nucleotide sequence shown herein, in the terms discussed 
already above (e.g. for length), which homology is greater 
over the length of the relevant part (i.e, fragment) than the 
homology shared with a respective part of the natural coding 
20 nucleotide sequence for the other amino acid sequences shown 
in Figure 8a or 8b and referenced herein, and may be greater 
than about 5% greater, more preferably greater than about 10% 
greater, more preferably greater than about 20% greater, and 
more preferably greater than about 30% greater. 

25 

The provision of sequence information for the VRN2 gene of 
Arabidopsis thaliana. enables the obtention of homologous 
sequences from other plant species. In particular, 
homologues may be easily isolated from related, commercially 
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important species that have a vernalization requirement, or 
Show some response to vernalization. These would include all 
members of the Brassicaceae , and other dicots including 
tobacco, sugarbeet, peas and celery. Monocots included in 
5 this category are the cereals rice, wheat and barley. 

Thus, included within the scope of the present invention are 
nucleic acid molecules which encode amino acid sequences which 
are homologues of vm2 of Aratidopsls thaliana. Homology may 
10 be at the nucleotide sequence and/or amino acid sequence 

level, as has already been discussed above. A homologue from 
a species other than AratMopsis thaliana encodes a product 
which causes a phenotype similar to that caused "by the VRN2 
gene, generally including ability to alter vernalization 
15 response, flowering time, leaf shape and/or shade avoidance 
response in a plant, such as in Arabidopsis thaliana. in 
addition, mutants, derivatives or alleles of these genes may 
have altered, e.g. increased or decreased, activity or ability 
compared with wild-type. 
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VRN2 gene homologues may also be identified from economically 
important monocotyledonous crop plants including the cereals 
rice, wheat and barley. Although genes encoding the same 
protein in monocotyledonous and dicotyledonous plants show 
25 relatively little homology at the nucleotide level, amino acid 
sequences are conserved. Therefore it is possible to use 
public sequence databases to identify Aratidopsis, rice or 
maize cDNA clone sequences that were obtained in random 
sequencing programmes and share homology to the gene of 



interest, as has been done for other genes isolated from 
Arabidopsis (e.g CO; WO 96/14414). Of course, mutants, 
derivatives and alleles of these sequences are included within 
the scope of the present invention in the same terms as 
discussed above for the Arabidopsis thaliana VRN2 gene. 

According to a further aspect, the present invention provides 
a method of identifying or a method of cloning a VRN2 
homologue, e.g. from a species other than Arabidopsis thaliana 
the method employing a nucleotide sequence derived from any of 
those shown herein. For instance, such a method may employ an 
oligonucleotide or oligonucleotides which comprises or 
consists of a sequence or sequences conserved between or 
encoding a sequence or sequences conserved between the 
sequences shown in Figure 8a or 8b, or a sequence or sequences 
conserved between the sequences of SEQ ID NO: 2 and SEQ ID NO: 
5, or encoding sequences SEQ ID NO-: 1 and SEQ ID NO : 4, to 
search for homologues. Thus, a method of obtaining nucleic 
acid is provided, comprising hybridisation of an 
oligonucleotide or a nucleic acid molecule comprising such an 
oligonucleotide to target /candidate nucleic acid. Target or 
candidate nucleic acid may, for example, comprise a genomic or 
cDNA library obtainable from an organism known to contain or 
suspected of containing such nucleic acid, either 
monocotyledonous or dicotyledonous. Successful hybridisation 
may be identified and target /candidate nucleic acid isolated 
for further investigation and/or use. 

Hybridisation may involve probing nucleic acid and identifying 
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positive hybridisation under suitably stringent conditions (in 
accordance with known techniques) and/or use of 
oligonucleotides as primers in a method of nucleic acid 
amplification, such as PGR. For probing, preferred conditions 
5 are those which are stringent enough for there to be a simple 
pattern with a small number of hybridisations identified as 
positive which can be investigated further. It is well known 
in the art to increase stringency of hybridisation gradually 
until only a few positive clones remain. 

10 

For instance, screening may initially be carried out under 
conditions, which comprise a temperature of about 37°C or 
more, a formamide concentration of less than about 50%, and a 
moderate to low salt (e.g. Standard Saline Citrate ('SSC') = 
15 0.15 M sodium chloride; 0 . 15 M sodium citrate; pH 7) 
concentration . 

Alternatively, a temperature of about 50°C or more and a high 
salt (e.g. ^SSPE'= 0.180 M sodium chloride; 9 mM disodium 

2 0 hydrogen phosphate; 9 mM sodium dihydrogen phosphate; 1 mM 

sodium EDTA; pH 7.4). Preferably the screening is carried out 
at about 37°C, a formamide concentration of about 20%, and a 
salt concentration of about 5 X SSC, or a temperature of about 
50°C and a salt concentration of about 2 X SSPE. These 

25 conditions will allow the identification of sequences which 
have a substantial degree of homology (similarity, identity) 
with the probe sequence, without requiring the perfect 
homology for the identification of a stable hybrid. 



Suitable conditions include, e.g. for detection of sequences 
that are about 80-90% identical, hybridization overnight at 
42<=C in 0.25M Na2HP04 , pH 7.2, 6.5% SDS , 10% dextran sulfate, 
and a final wash at 55°C in 0 . IX SSC, 0.1% SDS . For detection 
of sequences that are greater than about 90% identical, 
suitable conditions include hybridization overnight at 65°C in 
0.25M NasHPO^, pH 7.2, 6.5% SDS, 10% dextran sulfate and a 
final wash at 60°C in 0 . IX SSC, 0.1% SDS. 

An alternative is a solution of 5x SSPE (final 0.9 M NaCl , 
0.05M sodium phosphate, 0.005M ethylenediamnetetraacet ic acid 
(EDTA) pH 7.7), 5X Denhardt ' s solution, 0.5% SDS (sodium 
dodecyl sulphate), at 65'C overnight, (for high stringency, 
highly similar sequences) or 50 'C (for low stringency, less 
similar sequences). Washes in 0 . 2x SSC/0.1% SDS at 65"C for 
high stringency, alternatively at 50-60 'C in Ix SSC/0.1%SDS 
for low stringency. 

The present invention extends to nucleic acid selectively 
hybridisable under high stringency with nucleic acid 
identified herein . 

As an alternative to probing, though still employing nucleic 
acid hybridisation, oligonucleotides designed to amplify DNA 
sequences may be used in PGR reactions or other methods 
involving amplification of nucleic acid, using routine 
procedures. See for instance "PGR protocols; A Guide to 
Methods and Applications", Eds. Innis et ai , 1990, Academic 
Press, New York. 



Preferred amino acid sequences suitable for use in the design 
Of probes or PGR primers for some purposes are sequences 
conserved (completely, substantially or partly) between the 
VRN2 sequence and at least one other of the sequences shown in 

5 Figure 8a or 8b. 



Preferred primers for amplification of conserved regions of 
VRN2 for use as probes to obtain genomic or cDNA clones may 
include the following: 

10 

Primers VRN2-AI and VRN2-AJ which, in RT-PCR, amplify a 1583 
bp fragment that contains the complete VRN2 open reading 
frame, and portions of both the 5' and 3' untranslated 

sequences ; 

15 

Primers VRN2-AP and VRN2-AJ wh.ch, in RT-PCR, amplify a 781 bp 
fragment that includes the conserved acidic region; 

Primers VRN2-A0 and VRN2-AS which, in RT-PCR, amplify a 493 bp 
'.0 fragment that includes the zinc-finger motif, and the second 
nuclear localization signal (NLS) ; and 

Pramers VRN2-AI and VRN2-AJ from genomic DNA, which amplify a 
3605 bp product that includes most of the VRN2 gene, except 
5 the promoter and 3' regions (i.e. encompasses the same regions 
as the VR.2-AI/A. pair above, but with the introns, useful for 
hybridisation to genomic DNA, less so for cDNA) . 



On the basis of amino acid sequence informati 



on 
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oligonucleotide probes or primers may be designed, taking into 
account the degeneracy of the genetic code, and, where 
appropriate, codon usage of the organism from the candidate 
nucleic acid is derived. 



Preferably an oligonucleotide in accordance with certain 
embodiments of the invention, e.g. for use in nucleic acid 
amplification, is up to about 50 nucleotides, or about 40 
nucleotides or about 30 or fewer nucleotides in length (e.g. 
10 18, 21 or 24) . 

Assessment of whether or not such a PGR product corresponds to 
a homologue gene may be conducted in various ways . A PGR band 
from such a reaction might contain a complex mix of products. 
15 Individual products may be cloned and each one individually 
screened. It may be analysed by transformation to assess 
function on introduction into a plant of interest. 

As noted, nucleic acid according to the present invention is 
20 obtainable using oligonucleotides, designed on the basis of 
sequence information provided herein, as probes or primers. 
Nucleic acid isolated and/or purified from one or more cells 
of a plant (see above) , or a nucleic acid library derived from 
nucleic acid isolated and/or purified from the plant (e.g. a 
25 cDNA library derived from mRNA isolated from the plant) , may 
be probed under conditions for selective hybridisation and/or 
subjected to a specific nucleic acid amplification reaction 
such as the polymerase chain reaction (PGR) . The nucleic acid 
probed or used as template in the amplification reaction may 



5 
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be genomic DNA. cDNA or ^A. If necessary, one or ™ore gene 
fragments ™ay be ligated to generate a full-length coding 

sequence . 

5 PGR primers derived from thp vt?mo 

^rom tne VRN2 sequences disclosed herein 

™ay readily be tested for their specificity for amplifying 
nucleic acid accordrng to the present invention, using both 

genomic DMA and RT-Prp i-^rr. i ^ 

and RT PGR templates. Cloning and subsequent 

sequencing of PCR products may be used to indicate 

10 -Plification Of the expected derived gene fragment. Full 
length cDNA clones can be obtained as described by 5 ' and 3 ' 
RACE technology if rt-pcr products are used as templates. 

Various aspects of the present invention include the 
15 Obtainable nucleic acid, methods of screening material e g 
cell lysate, nucleic acid preparations, for the presence of 
nucleic acid of interest, methods of obtaining the nucleic 
acid, and suitable primers and primer combinations. 

20 The sequence information provided herein also allows the 

design of diagnostic tests for determination of the presence 
Of a specific gene or allele thereof in any given plant, 
cultivar, variety, population, landrace, part of a family or 
other selection in a breeding programme or other such 

25 genotype. ^ diagnostic test may be based on determination of 
the presence or absence of a particular allele by means of 
nucleic acid or polypeptide determination. 



At the nucleic acid level, thi 



s 



may involve hybridisation of 



a 



suitable oligo- or poly-nucleot ide , such as a fragment of the 
gene or a homologue thereof, including any homologue disclosed 
herein, or any particular allele, such as an allele, which 
gives a desirable phenotype, such as any such allele disclosed 
herein. The hybridisation may involve PGR designed to amplify 
a product from a given allelic version of the gene, with 
subsequent detection of an amplified product by any of a 
number of possible methods including but not limited to gel 
electrophoresis, capillary electrophoresis, direct 
hybridisation of nucleotide sequence probes and so on. A 
diagnostic test may be based on PGR designed to amplify 
various alleles or any allele from the relevant locus, with a 
test to distinguish the different possible alleles by any of a 
number of possible methods, including DNA fragment size, 
restriction site variation (e.g. GAPS - cleaved amplified 
polymorphic sites) and so on. A diagnostic test may also be 
based on a great number of possible variants of nucleic acid 
analysis that will be apparent to those skilled in the art, 
such as use of a synthetic sequence as a hybridisation probe. 

Broadly, the methods divide into those screening for the 
presence of nucleic acid sequences and those that rely on 
detecting the presence or absence of a polypeptide. The 
methods may make use of biological samples from one or more 
plants or cells that are suspected -to contain the nucleic acid 
sequences or polypeptide. 

Exemplary approaches for detecting nucleic acid or 
polypeptides include analysing a sample from the plant or 
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plant cell by: 

(a) comparing the sequence of nucleic acid in the sample with 
all or part of a nucleotide sequence shown herein, to 

5 determine whether the sample contains a mutation; 

(b) determining the presence in the sample of a polypeptide 
including a VRN2 amino acid sequence shown herein, or a 
fragment thereof and, if present, determining whether the 

10 polypeptide is full length, and/or is mutated^ and/or is 
expressed at the normal level; 

(c) performing DNA fingerprinting to compare the restriction 
pattern produced when a restriction enzyme cuts nucleic acid 

15 in the sample with the restriction pattern obtained from a 
nucleotide sequence shown herein, or from a known mutant, 
allele or variant thereof; 

(d) contacting the sample with a specific binding member 

20 capable of binding to nucleic acid including the nucleotide 
sequence as set out herein, or a fragment thereof, or a 
mutant, allele or variant thereof, the specific binding member 
including nucleic acid hybridisable with a VRN2 sequence 
herein, or a polypeptide including a binding domain with 

25 specificity for nucleic acid including a VRN2 sequence or 
polypeptide encoded by it, or a mutated form thereof, and 
determining binding of the specific binding member; 



(e) performing PGR involving one or more primers based on a 



m 



0^ 
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nucleot J.de sequence shown herein to screen the sample for 
nucleic acid including the nucleotide sequence of SEQ ID NO: 1 
or SEQ ID NO: 4 or a mutant, allele or variant thereof. 

5 When screening for a VRN2 allele nucleic acid, the nucleic 
acid in the sample will initially be amplified, e.g. using 
PGR, to increase the amount of the analyte as compared to 
other sequences present in the sample. This allows the target 
sequences to be detected with a high degree of sensitivity if 

0 they are present in the sample. This initial step may be 
avoided by using highly sensitive array techniques that are 
becoming increasingly important in the art . 

A variant form of the gene may contain one or more insertions, 
5 deletions, substitutions and/or additions of one or more 

nucleotides compared with the wild-type sequence which may or 
may not disrupt or alter the gene function. Differences at 
the nucleic acid level are not necessari ly reflected by a 
difference in the amino acid sequence of the encoded 
0 polypeptide . However, a mutation or other difference in a 
gene may result in a frame- shift or stop codon, which could 
seriously affect the nature of the polypeptide produced (if 
any) , or a point mutation or gross mutational change to the 
encoded polypeptide, including insertion, deletion, 
5 substitution and/or addition of one or more amino acids or 

regions in the polypeptide, A mutation in a promoter sequence 
or other regulatory region may prevent or reduce expression 
from the gene or affect the processing or stability of the 
mRNA transcript , 
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Lons 



Te..3 .ay be «..ie. ou^ on preparations containing genomic 

Of the complexity of the nucleic acid h« ■ 

u^-xeic acid being reduced by the 

absence of intron sequences hnh 

quences, but the possible disadvantage of 
5 extra time and effm-h 

^^'"9 required in making the 

preparations. rna is more difficult ^o n. • . 

uirricult to manipulate than DNA 

because of the wide-spread occurrence of ^.ases, 

N-leic acid in a test sample .ay be sequenced and the 
- se^ence compared with a sequence shown herein to determine 
Whether or not a difference is present. „ so, the difference 
can he compared with .nown aUeles to determine whether the 
test nucleic acid contains one or .ore of the variati, 

indicated, or thf^ h-i-f^^ 

the difference can be investigated for 

15 association with a desired phenotype. 

The amplified nucleic acid .ay then be sequenced as above 
and/or tested in any other way to determine the presence Ir 
absence of a partrcular feature. »ucle.c a„d for testing .ay 
be prepared fro. nucleic acid re.oved fro. cells or in a 
library using a variety of other techniques such as 
restriction en.y.e digest and electrophoresis. 

nucleic acid .ay be screened using a variant- or allele 
=5 specific probe. Such a probe corresponds in sequence to a 
region of the qene or- -ii-^ 

g^ne, rts complement, containing a sequence 

alteration .nown to be associated with alteration of ability 
to affect vernali.ation response, flowering time, leaf shape 
and/or shade avoidance response. Under suitably stringent 
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conditions, specific hybridisation of such a probe to test 
nucleic acid is indicative of the presence of the sequence 
alteration in the test nucleic acid. For efficient screening 
purposes, more than one probe may be used on the same test 
5 sample. 

Allele- or variant-specific oligonucleotides may similarly be 
used in PGR to specifically amplify particular sequences if 
present in a test sample. Assessment of whether a PGR band 

0 contains a gene variant may be carried out in a number of ways 
familiar to those skilled in the art. The PGR product may for 
instance be treated in a way that enables one to display the 
mutation or polymorphism on a denaturing polyacrylamide DNA 
sequencing gel, with specific bands that are linked to the 

5 gene variants being selected. 

An alternative or supplement to looking for the presence of 
variant sequences in a test sample is to look for the presence 
of the normal sequence, e.g. using a suitably specific 
0 oligonucleotide probe or primer. 

Approaches which rely on hybridisation between a probe and 
test nucleic acid and subsequent detection of a mismatch may 
be employed. Under appropriate conditions (temperature, pH 
5 etc.), an oligonucleotide probe will hybridise with a sequence 
which is not entirely complementary. The degree of base- 
pairing between the two molecules will be sufficient for them 
to anneal despite a mis-match. Various approaches are well 
known in the art for detecting the presence of a mis-match 
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between two annealing nucleic acid molecules. 

For instance, RN'ase A cleaves at the site of a mis-match. 
Cleavage can be detected by electrophoresing test nucleic acid 
5 to which the relevant probe or probe has annealed and looking 
for smaller molecules (i.e. molecules with higher 
electrophoretic mobility) than the full length probe/test 
hybrid. Other approaches rely on the use of enzymes such as 
resolvases or endonucleases . 

10 

Thus, an oligonucleotide probe that has the sequence of a 
region of the normal gene (either sense or anti-sense strand) 
in which mutations associated with particular phenotypes are 
known to occur may be annealed to test nucleic acid and the 

15 presence or absence of a mis-match determined. Detection of 
the presence of a mis-match may indicate the presence in the 
test nucleic acid of a mutation. On the other hand, an 
oligonucleotide probe that has the sequence of a region of the 
gene including a mutation may be annealed to test nucleic acid 

20 and the presence or absence of a mis-match determined. The 

presence of a mis-match may indicate that the nucleic acid in 
the test sample has the normal sequence, or a different mutant 
or allele sequence. In either case, a battery of probes to 
different regions of the gene may be employed. 

25 

The presence of differences in sequence of nucleic acid 
molecules may be detected by means of restriction enzyme 
digestion, such as in a method of DNA fingerprinting where the 
restriction pattern produced when one or more restriction 
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enzymes are used to cut a sample of nucleic acid is compared 
with the pattern obtained when a sample containing the normal 
gene or a variant or allele is digested with the same enzyme 
or enzymes . 

The presence of absence of a lesion in a promoter or other 
regulatory sequence may also be assessed by determining the 
level of mRNA production by transcription or the level of 
polypeptide production by translation from the mRNA . 



Nucleic acid isolated and/or purified from one or more cells 
of a plant or a nucleic acid library derived from nucleic acid 
isolated and/or purified from cells (e.g. a cDNA library 
derived from mRNA isolated from the cells) , may be probed 
15 under conditions for selective hybridisation and/or subjected 
to a specific nucleic acid amplification reaction such as the 
polymerase chain reaction (PCR) . 



A method may include hybridisation of one or more (e.g. two) 
20 probes or primers to target nucleic acid. Where the nucleic 
acid is double-stranded DNA, hybridisation will generally be 
preceded by denaturation to produce single-stranded DNA. The 
hybridisation may be as part of a PCR procedure, or as part of 
a probing procedure not involving PCR. An example procedure 
25 would be a combination of PCR and low stringency 

hybridisation. A screening procedure, chosen from the many 
available to those skilled in the art, is used to identify 
successful hybridisation events and isolate hybridised nucleic 
acid . 
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-easu^ed „3.„, 3 .„,e., of .e..„.,.es a. t.e disposal 

of those skilled in the art c 

the art. For instance, probes may be 

radioactively, f luoreqppnt- 1 

luorescently or enzymaticaUy labelled, other 

S .ethod. not e„plo,l„, labelUn, of probe Include examination 
Of restriction fragment length pol^orphls^s, amplification 

using PGR, RNAase cleavaqe and ^^n^i 

age and allele specific oligonucleotide 

probing. 

" P.oMng „a. employ the standard Southern Mottlng teohnl^e 
For instance 0» may he extracted from cells and digested .1th 
.fferent restriction enzymes. Restriction fragments may then 
W electrophoresis on an agarose ,el , before 
denaturation and transfer to a nltrooell„lose filter 
- Labelled probe may be hybridised to the 0» fragments on the 
f.Uer and binding determined. MA for probing may be 

prepared from RNA urBr^^-r;,^ ^ ^ 

preparations from cells. 

PreUm.„ary experrments may be performed by hybrldisrng under 
low stringency conditions various probes to Southern blots of 

" -strlctlon enzymes. Suitable conditions 

would be achieved when a 1=^„. 

a large number of hybridising fragments 

were Obtained while the background hybrrdisat.on was low 
using these conditions nucleic acid libraries, e.g. co« ^ 
irbrarles representative of expressed sequences, may be 
searched. 

As noted, those skilled in hv,= 

^lled m the art are well able to employ 

suitable conditions of the desi r^H • 

^ne desired stringency for selective 



hybridisation, taking into account factors such as 
oligonucleotide length and base composition, temperature and 
so on . 

In some preferred embodiments of diagnostic assays according 
to the present invention, oligonucleotides according to the 
present invention that are fragments of any of the sequences 
shown herein, or any allele associated with a desired 
phenotype are at least about 10 nucleotides in length, more 
preferably at least about 15 nucleotides in length, more 
preferably at least about 2 0 nucleotides in length, more 
preferably about 30 nucleotides in length. Such fragments 
themselves individually represent aspects of the present 
invention. Fragments and other oligonucleotides may be used 
as primers or probes as discussed but may also be generated 
(e.g. by PGR) in methods concerned with determining the 
presence in a test sample of a sequence indicative of a 
desired phenotype . 

There are various methods for determining the presence or 
absence in a test sample of a particular polypeptide, such as 
a polypeptide including the amino acid sequence shown in SEQ 
ID NO: 2 or SEQ ID NO: 5, or an amino acid sequence mutant, 
variant or allele thereof. 

A sample may be tested for the presence of a binding partner 
for a specific binding member such as an antibody (or mixture 
of antibodies) , specific for one or more particular variants 



of a polypeptide shown herei 



in such cases, the sample ™y be tested by being contacted 
with a specific binding ,„e»ber such as an antibody under 
5 appropriate conditions for specific binding, before bindrng i 
determined, for instance using a reporter system as discussed 
Where a panel of antibodies is used, different reporting 
labels may be employed for each antibody so that binding of 
each can be determined. 
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A specific binding member such as an antibody may be used to 
isolate and/or purity its binding partner polypeptide from a 
test sample, to allow for se,r.ence and/or biochemical analysis 
Of the polypeptide to determine whether it has the sequence 
15 and/or properties of the wild-type polypeptide or a particular 
mutant, variant or allele thereof. Amino acid sequence is 
routine in the art using automated sequencing machines. 



0 The use of diagnostic tests for alleles allows the researcher 
or plant breeder to establish, with full confidence and 
independent from time consuming brochemical tests, whether or 
not a desired allele is present in tho plant of interest <or a 
oell thereof) , whether the plant is a representative of a 

5 collection of other genetically identical plants (e.g. an 
inbred variety or cultivar, or one individual in a sample of 
related (e.g. breeders' selection) or unrelated plants. 

in a breeding scheme based on selection and selfing of 
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desirable individuals, nucleic acid or polypeptide diagnostics 
for the desirable allele or alleles in high throughput, low 
cost assays as provided by this invention, reliable selection 
for the can be made at early generations and on more material 
5 than would otherwise be possible. This gain in reliability of 
selection plus the time saving by being able to test material 
earlier and without costly phenotype screening is of 
considerable value in plant breeding. 



10 Nucleic acid-based determination of the presence or absence of 
one or more desirable alleles may be combined with 
determination of the genotype of the flanking linked genomic 
DNA and other unlinked genomic DNA using established sets of 
markers such as RFLPs, microsatellites or SSRs, AFLPs, RAPDs 

15 etc. This enables the researcher or plant breeder to select 

for not only the presence of the desirable allele but also for 
individual plant or families of plants which have the most 
desirable conobinat i ons of linked and unlinked genetic 
background. ■ Such recombinations of desirable material may 

2 0 occur only rarely within a given segregating breeding 

population or backcross progeny. Direct assay of the locus as 
afforded by the present invention allows the researcher to 
make a stepwise approach to fixing (making homozygous) the 
desired combination of flanking markers and alleles, by first 

25 identifying individuals fixed for one flanking marker and then 
identifying progeny fixed on the other side of the locus all 
the time knowing with confidence that the desirable allele is 
still present. 



•I 
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The present disclosure provides sufficient information for a 
person s.iUed in the art to obtain genomic DMA sequence for 

any given new or exis^■ino =,11^1-, ^ ■, 

existing allele and devise a suitable nucleic 

acid- and/or polypeptide -based diagnostic assay. in designing 
5 a nucleic acid assay account is taken of the distinctive 
variation in sequence that characterises the particular 
variant allele. 



Nucleic acid according to the invention may include a 
10 nucleotide sequence encoding a product involved i„ 

vernalization response, flowering time, leaf shape and/or 
Shade avoidance response. Reducing or increasing the level of 
expression may be used to manipulate such a characteristic in 
a Plant. This may involve use of anti-sense or sense 
15 regulation, discussed further below. 

Hucleic acid according to the invention, such as a gene 
or homologue, may be placed under the control of an e.ternallv 
inducrble gene promoter to place expression under the control 
=0 Of the user. An advantage of introduction of a heterologous 
gene into a plant cell, particularly when the cell is 
comprised in a plant, is the ability to place expression of 
the gene under the control of a promoter of choice, in order 
to be able to influence gene expression, and therefore 
2S vernauzation response, flowering time, leaf shape, shade 

avoidance response, and/or other characteristic, according to 
preference. Furthermore, mutants and derivatives of the 
Wild-type gene, e.g. with higher or lower activity than 
Wild-type, may be used in place of the endogenous gene 




In the present invention, over-expression may be achieved by- 
introduction of the nucleotide sequence in a sense 
orientation. Thus, the present invention provides a method of 
influencing a physical characteristic of a plant, the method 
5 including causing or allowing expression of the product 
(polypeptide or nucleic acid transcript) encoded by 

I 

heterologous nucleic acid according to the invention from that 
nucleic acid within cells of the plant. 

10 Down-regulation of expression of a target gene may be achieved 
.using anti-sense technology or "sense regulation" ("co- 
suppression" ) . 

In using anti-sense genes or partial gene sequences to 
15 down-regulate gene expression, a nucleotide sequence is placed 
under the control of a promoter in a "reverse orientation" 
such that transcription yields RNA which is complementary to 
normal mRNA transcribed from the "sense" strand of the target 
gene. See, for example, Rothstein et al, 1987; Smith et 
20 al, (1988) Nature 334, 724-726; Zhang et al, (1992) The Plant 
Cell 4, 1575-1588, English et al . , (1996) The Plant Cell 8, 
179-188. Antisense technology is also reviewed in Bourque , 
(1995), Plant Science 105, 125-149, and Flavell, (1994) PNAS 
USA 91, 3490-3496. 

25 

An alternative is to use a copy of all or part of the target 
gene inserted in sense, that is the same, orientation as the 
target gene, to achieve reduction in expression of the target 
gene by co- suppression . See, for example, van der Krol et 
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al., (1990) The Plant Cell 2, 291-299; Napoli et al , , (1990) 
The Plant Cell 2, 279-289; Zhang et al . , (1992) The Plant Cell 
4, 1575-1588, and US-A- 5 , 231 , 020 . 

5 The complete sequence corresponding to the coding sequence (in 
reverse orientation for anti-sense) need not be used. For 
example fragments of sufficient length may be used. It is a 
routine matter for the person skilled in the art to screen 
fragments of various sizes and from various parts of the 

10 coding sequence to optimise the level of anti-sense 

inhibition. It may be advantageous to include the initiating 
methionine ATG codon, and perhaps one or more nucleotides 
upstream of the initiating codon. A further possibility is to 
target a conserved sequence of a gene, e.g. a sequence that is 

15 characteristic of one or more genes, such as a regulatory 
sequence . 

The sequence employed may be about 500 nucleotides or less, . 
possibly about 400 nucleotides, about 300 nucleotides, about 

20 200 nucleotides, or about 100 nucleotides. It may be possible 
to use oligonucleotides of much shorter lengths, 14-23 
nucleotides, although longer fragments, and generally even 
longer than about 500 nucleotides are preferable where 
possible, such as longer than about 600 nucleotides, than 

25 about 700 nucleotides, than about 800 nucleotides, than about 
1000 nucleotides or more. 

It may be preferable that there is complete sequence identity 
in the sequence used for down-regulation of expression of a 



target sequence, and the target sequence, though total 
complementarity or similarity of sequence is not essential. 
One or more nucleotides may differ in the sequence used from 
the target gene. Thus, a sequence employed in a down- 
5 regulation of gene expression in accordance with the present 
invention may be a wild-type sequence (e.g. gene) selected 
from those available, or a mutant, derivative, variant or 
allele, by way of insertion, addition, deletion or 
substitution of one or more nucleotides, of such a sequence. 

10 The sequence need not include an open reading frame or specify 
an RNA that would be translatable. It may be preferred for 
there to be sufficient homology for the respective anti-sense 
and sense RNA molecules to hybridise. There may be down 
regulation of gene expression even where there is about 5%, 

15 10%, 15% or 20% or more mismatch between the sequence used and 
the target gene. 



Generally, the transcribed nucleic acid may represent a 
fragment of a gene, or the complement thereof, or may be a 

20 mutant, derivative, variant or allele thereof, in similar 

terms as discussed above in relation to alterations being made 
to a coding sequence and the homology of the altered sequence. 
The homology may be sufficient for the transcribed ant i- sense 
RNA to hybridise with nucleic acid within cells of the plant, 

25 though irrespective of whether hybridisation takes place the 
desired effect is down-regulation of gene expression. 



Thus, the present invention also provides a method of 
modifying, affecting, altering or modulating a characteristic 
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of a plant, e.g. vernalization response, flowering time, leaf 
shape and/or shade avoidance response, the method including 
causing or allowing anti-sense transcription from heterologous 
nucleic acid according to the invention within cells of the 
5 plant . 

The present invention further provides the use of the 
nucleotide sequence of vm2, or a fragment, mutant, 
derivative, allele, variant or homologue thereof for down- 
10 regulation of gene expression, particularly down-regulation of 
expression of a gene or homologue thereof, preferably in 

order to influence a physical characteristic of a plant, 
especially vernalization response, flowering time, leaf shape 
and/or shade avoidance response. 

15 

When additional copies of the target gene are inserted in 
sense, that is the same, orientation as the target gene, a 
range of phenotypes is produced which includes individuals 
where over-expression occurs and some where under-expression 

20 of protein from the target gene occurs. When the inserted 
gene is only part of the endogenous gene the number of 
under-expressing individuals in the transgenic population 
increases. The mechanism by which sense regulation occurs, 
particularly down-regulation, is not well-understood. 

25 However, this technique is also well -reported in scientific 
and patent literature and is used routinely for gene control. 
See, for example, van der Krol et al . , (1990) The Plant Cell 
2, 291-229; Napoli et al . , (1990) The Plant Cell 2, 279-289; 
Zhang et al , 1992 The Plant Cell 4, 1575-1588. 
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Again, fragments, mutants and so cn may be used in similar 
terms as described above for use in anti-sense regulation. 



Thus, the present invention also provides a method of 
5 influencing a characteristic of a plant, e.g. vernalization 
response, flowering time, leaf shape and/'or shade avoidance 
response, the method including causing or allowing expression 
from nucleic acid according to the invention within cells of 
the plant. This may be used to suppress activity of a product 
10 with ability to influence vernalization response, flowering 
time, leaf shape and/or shade avoidance response. Here the 
activity of the product is preferably suppressed as a result 
of under-expression within the plant cells. 



15 

Aspects and embodiments of the present invention will now be 
illustrated, by way of example, with reference to the 
accompanying fagux^es. Further aspects and embodiments will be 
apparent to those skilled in the art. All documents mentioned 
20 in this text are incorporated herein by reference. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 shows total leaf number (Rosette plus Cauline) of Ler 
25 plants (squares) , jfca-I plants (diamonds) and vjrn2-I fca-I 
plants (circles) after various periods of vernalization 
(measured in days) . 



Figure 2 illustrates the Red Far-Red Light plant phenotype : 
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Figure .A Shows the area of the largest laa£ of .er fca 

bars) or white light with supplementary PR light („ . PR, 

(shaded bars) . 

P.9ure .B shows the rosette leaf nu^er at bolting of 

plants treated i -m ^-u 

as an the experxment of whxch results are shown 

m Figure 2A. 

Figure 3 illustrates results of genetio an. physical mapping 
10 Of VRN2 in ;iral,i dopsi s thaliana. 

Figure 4 illustrates oosmids and genes near in the 

Arabidopsis genome. 

" Figure S illustrates the structure of the gene of 

.ra.idopsis, the » cDM. aberrant splicing, and the position 
and nature of the .rn.-l „„tation. Bxons are shown as open 
boxes. Unstranslated regions are shown as shaded boxes. 
Introns are shown as lines. 

20 

Figure . Shows the sequence of v™. oDHA, including coding 
sequence and predicted amino acid sequence of the encoded 
protein. Putative NLSs are boxed, the putative acidic 
aotivat.on domain is underlined. The putative .inc-finger 

25 motif is doubly underlinf^ri tv,^ • ■ 

aerimed. The positions of introns are 

indicated with arrow=! tv,^ • • 

arrows. The position of the vrn2-l mutation is 

circled. 



Figure 7 illustrate 



s dCAPS Marker for the .rn2-l Mutation, 



diagnostic derived CAPS (dCAPS) marker was designated for the 
vrn2-l mutation. This utilizes a primer (VRN2-AZ) that 
includes half of the recognition site for the XmnI restriction 
enzyme, the other half is supplied, specifically, by the 
sequence of the \Arn2-I mutation. This results in a successful 
restriction digestion only when using PGR amplified genomic 
DNA from vrn2~l mutants as a template. 

Figure 8 shows alignment of the Arabidopsis VRN2 amino acid 
sequence with similar proteins. 

Figure 8A aligns the full-length VRN2 protein with four 
other proteins, using Clustal method with PAJy[250 residue 
weight table, performed on 17 January 1999 at 19:19 GMT. 

Figure SB aligns the zinc finger region, using Clustal 
method with PAJyi250 residue weight table, performed on 17 
January 1999 at 19; 25 GMT. 

A]Dhre\ri ati on s : 

At Arabidopsis thaliana , Sc Saccharomyces cerevisiae , Sp 
Schizosaccharomyces pombe , Ce Caenorhabditis Glegans , Dm 
Drosophila melanogaster , Hs Homo sapiens, Mm Mus musculus , Rn 
Rattus norvegicus, Xm Xiphophorus inaculatus , 

LIST OF SEQUENCES 

SEQ ID NO: WSEQUENCE 

1 Landsberg erecta VRN2 cDNA 

2 Landsberg erecta VRN2 amino acid 
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3 Landsberg erecta VRN2 genomic 

4 Columbia VRN2 cDNA 

5 Columbia VRN2 amino acid 

6 Columbia VRN2 genomic 

5 7 5K (Columbia, abberant splice) cDNA 

8 5K (Columbia, abberant splice) amino acid 

9 C72616 EST (modified) cDNA 

10 C72616 EST (modified) amino acid 

11 AI163743 EST (modified) cDNA 

10 12 AI163743 EST (modified) amino acid 

13 At Hyp 2245035 (ATFCA7_4) (modified) cDNA 

14 At Hyp 2245035 (ATFCA7_4) (modified) amino acid 

15 KIAA0160 cDNA 

16 KIAA0160 amino acid 
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Additional sequences included in the Figures : 



17 Landsberg erecta VRN2 zinc finger amino acid 

18 At Dil9 S51478 zinc finger 1 amino acid 
20 19 At Dil9 S54178 zinc finger 2 amino acid 

20 At SUP U38946 zinc finger amino acid 

21 At Hyp 2191171 zinc finger amino acid 

22 At Hyp 3377806 zinc finger amino. acid 

23 Sc Pep7 91500 zinc finger amino acid 

25 24 Sc TFIIIA 730931 zinc finger amino acid 

25 Sp Hyp 1351713 zinc finger amino acid 

26 Ce Hyp 255942 zinc finger amino acid 

27 Ce Hyp 2854197 zinc finger amino acid 

28 Ce Hyp 304459 zinc finger amino acid 
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29 Dm BRCORE-NS-Z3 zinc finger amino acid 

30 Dm GAGA 729556 zinc finger amino acid 

31 Dm ken 3550814 zinc finger amino acid 

32 Hs ATBF-1 976347 zinc finger amino acid 
5 33 Hs KIAA0160 zinc finger amino acid 

34 Hs ZNF142 3123312 zinc finger amino acid 

35 Mm FOG 2252814 zinc finger amino acid 

36 Mm Spalt 1296845 zinc finger amino acid 

37 Rn Roaz 2149792 zinc finger amino acid 
10 38 Xm ZFl 532083 zinc finger amino acid 



EXAMPLE 1 

Characterisation and Cloning of VRN of Arabidopsis Thaliana 
and Mutant Alleles Thereof 

15 

Isolation of vrn2 mutants 



Two vrn2 mutant alleles (vria2-j and ■vrn2-2) v/ere isolated by 
mutagenising fca-1 seeds with EMS as described by Chandler et 
20 ai. (Plant J (1996) 10: 637-644). WO96/38560 (PCT/GB96/01332 ) 
discloses the sequence of fca and mutant alleles and their 
cloning and characterisation. The vrn2-I fca-1 line used here 
has been backcrossed to fca~l four times. For mapping 
purposes, the vrn2~l allele was used (at the 2nd backcross) . 

25 

Phenotvpic Characterization 



Vernaliza tion 
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The vernalization response of .rn2 mutant plants was 
investigated by examining their flowering ti.e in response to 
increasing durations of vernalization treatment. 

5 Standard vernalization oondrtions were used, i.e. low Ught 
intensity 5 ,„ol s-, , hr photoperiod, 5.1 degree C, for 
varying periods ,fro™ between 1 and 42 days) . Similar effects 
would be observed under continuous or no light, the 
temperature is more important. 
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in the absence of a vernalization treatment, vrn2-J fca-I 
mutant plants showed a small but consistent delay in flowering 
compared to the parental (wild-type, fca-I controls (Figure 
1). However, following vernalization, this difference was 
15 greatly magnified (Figure 1, . The response exhibited by vrn.- 
1 fca-1 plants was typically a 35% reduction in total leaf 
number after 6 weeks of vernalization, compared to 67% 
reduction or fca-1 controls. The delay in flowering, both 
with and Without a vernalization treatment, as measured by 
=0 increased leaf nu^er, was also observed if the days to flower 
I.e. the day at which the first floral bud was visible was 



used, 



25 



Red/Far-Red light perception 

The response to vernalization in Aratidopsis has been 
positively correlated with the response to different ratios of 
Red(R) to Par-red(PR) n,,,, ,,,,,,, ^^^^^^^ ^^^^ ^^^^^^^ 

strongly tend to respond strongly to conditions of low R • PR 
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(Bagnall, Ann Bot (1993) 71: 75-83; Martinez-Zapater et al . , 
Plant Physiol (1990) 92: 770-776). This response typically 
manifests itself in two distinct ways - an acceleration of 
flowering time (leading to an effect that mimics the effect of 
5 a vernalization treatment) and a reduction in leaf area (or 
shade avoidance) . This response is believed to have evolved 
to allow plants to adapt to the availability of light 
allowing individuals to seek light when in competition with 
their neighbors. 

10 

We examined the ability of vrn2-l fca-1 mutants to respond to 
conditions that mimic such an environment. 



Under these conditions, vrn2~l fca-1 plants showed a marked 
15 reduction in the shade avoidance response, with the mean area 
of the largest leaf decreasing by only 26%, compared to 74% 
for the fca-1 control (Figure 2A) . However, the v'rn2-l 
mutation does not appear to affect all aspects of the response 
to FR light, as vr-n2-I fca-1 plants showed a similar 
20 acceleration of flowering in response to supplementary FR 
light as fca-1 controls (Figure 2B) . 



These data provide indication that VRN2 plays a role in 
regulating the response to FR light, and may mediate changes 
25 specifically in leaf size (as flowering time is only slightly 
affected) under conditions of low R : FR ratios. 



Gene t i c Mapp inQ 
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The VRN2 gene was mapped in an F2 population derived from the 
cross \rrn2-l fca~l crossed to fca-10, following the procedure 
used to map the VRNl gene (Chandler et al . supra.). The VRN2 
gene was initially positioned using a population of 70 F2 
5 individuals between the RFLP markers gl3683 and mill2 (Schmidt 
et al.. Plant J (1996) 9: 755-765) on the long arm of 
Chromosome IV (or D) , using conventional techniques. The map 
position of VRN2 was further refined by screening an 
additional 429 F2 plants with a SSLP derived from the marker 

10 gl9247 (Schmidt et al . , (1996) Plant J 9: 755-765) and the 

CAPS marker g4539 (Parker et al . , Plant Cell (1998) 9: 1-17), 
A total of 12 individual F2 plants that were recombinant 
between gl9247 and g4539 were further analyzed with RFLP 
markers gl3683 and CC36F6 (Bancroft et al . , Weeds World 

15 4ii:(1997)) and the CAPS marker C18 (Parket et al , , supra.), 
and with two additional CAPS markers {VRN2RS, VRN2CD) 
generated using the published Columbia sequence (Bevan et al . , 
Nature (1998) 391: 485-488) 

(accession numbers Z97341 and Z97342) as a template. The VRN2 
20 gene was localized to a 245 kb region defined at the 

centromeric (north) end by an RFLP detected with the cosmid 
CC3 6F6, and at the telomeric (south) end by the g453 9 CAPS 
marker. This interval is defined by 3 recombinant individual 
F2 plants; 1 recombinant between VRN2 and CC36F6, and 2 
2 5 recombinants between g4 53 9 and VRN2 (Figure 3) . 

Physical Mapping 



The genetic interval defined by CC36F6 and g4539 is almost 



completely covered by the 3 BAG clones T1C7, I5D3 and T5015. 
Cosmids derived from subclones of YAC EW16B10 in the binary 
vector 04541 (Bancroft et al . , supra,) (abbreviated as IB) 
were positioned by end sequencing, and ordered relative to the 
published sequence of the Columbia ecotype in this region 
(Genbank accession numbers Z97341 and Z97342) . We selected 
cosmids clones that extended from the complex RPP5 locus 
outwards to CC36F6 and g4539, reasoning that VRN2 was not 
within the RPP5 locus, which is comprised of multiple repeats 
of RPP5-like genes in both Columbia and Landsberg ecotypes 
(Bevan et ai . , supra.). 

Additional Landsberg cosmids in the 04541 binary vector 
covering the region not covered by the Columbia YAC subclone 
cosmids were identified by hybridization to the inserts from 
BACs T1C7 and T5015, and aligned based on end sequencing, and 
compared to the published Columbia sequence (Bevan et al . , 
supra,) and to the sequence of the Landsberg ecotype in this 
region. An almost complete cosmid contig was generated over 
this region. 

Simultaneously with the isolation of cosmids, ordered cosmids, 
beginning with those at the centromeric end of the contig, 
were transformed into vrn2-l fca-1 plants by Agrobacterium 
tu/Tjefaciens-mediated vacuum infiltration (Bechtold et al . , C R 
Acad Sci Paris (1993) 316: 1194-1199). (Figure 3). The 
presence of the cosmid in each transgenic line (Tl plants) was 
confirmed by a cosmid- specif ic diagnostic PGR, comprising an 
insert specific primer (corresponding to a portion of the 



■\ ^ — " " Ik. dill, 



(..;,.■- 

62 



Columbia genomic DNA) and a primer nr... . • 

P^irner present in the cosmid 

vector. 



S^^^^i^^omnlementa^ 



5 



int.o..ce. i„.o ^.^.^ ^^^^^^ 

he.r ability to co™ple»»„e the p.enotype. X2 seeds 

.o™ i„.i.i,„., „ ^^^^^^^^^^^^ .a„a™.ci„ .es.stan=e at 

-1 ratio, .e.e so™ o„ soil ana ve„aU.ea .o. two ,i„ so.e 
e.pe.™e„ts tb.ee, „ee.3. Pl^ts t.en t.a„s.e.e. to 

.„e„.ouse oo..itions, a„. a.te. ten .a.s pric.e. ont into 

individual oompartments of divided trays T„^ , , 

trays. Total leaf number 
•as determined, and cosmids were sc„„,i 

" complementing if 

- segregation ratio of early to late plants ,.Ken oompared 
o .oa-l and vrn... controls, plants was approximately 



n) Clearly ocmplemented the phenotype of vrn.-i fea-1 

20 mutants, with the e^rlio^^ i 

^'^"^^ ing at approximately 

the same time as vernalized fca-1 plants. 
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Prediction 



- The sequence in oommon to both la«.3 and 1B.„1 oosmids Has 
previously been annotated as oontainrng . complete predicted 

genes (ATDL4445W and ATDL4450W) and < 

u-i^buw; and (presumably 

non-functional) portions of ^ , 

P rtzons of two other genes - the 3 ' end of 

ATDL4440W and the 3' end of the RPPs i,. 

cne RPP5-like gene, CHPR 
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{ATDL44 60W) (Bevan et al . , supra,) Genbank accession number 
Z97342. In addition, a cognate cDNA (5K) not included in the 
annotation is present in this region, and appears to span two 
of the predicted genes (ATDL4445W and ATDL4450W) (Bevan et 
5 al . , supra.). However, as two cosmids in the region (1 

independent Tl line from each of cosmids IB4N6 and IB6C5) did 
not complement the mutant phenotype (Figure 4) this ruled out 
the predicted gene ATDL4445W. This left the unannotated cDNA 
5K, and the predicted gene ATDL4450W as candidates for VRN2 . 
10 However, the presence the cognate cDNA 5K from the Columbia 
ecotype that overlapped both ATDL4445W and ATDL4450W 
necessitated a re-examination of the prediction for the 
ATDL4 4 5 0 W gene . 



15 In order to define the structure of these genes, we used the 
NetGene2 prediction program (Hebsgaard et al . , Nucl Acids Res 
(1998) 24: 3439-3452), using " Arabidopsi s " as the organism 
option (the only parameter that can be set manually). BLAST, 
PSI -BLAST, PSORT and PROSITE programs were used to identify 

20 potential function domains and similarities (Altschul et al . , 
Nucleic Acids Res (1997) 25: 3389-3402; Bairoch et al . , Nucl 
Acids Res (1997) 25: 217-221; Nakai et al . , Genomics (1992) 
14: 897-911). Default parameters of TBLASTN, PSI-BLAST and 
BLASTP were used (Expect = 10, BLOSUM62 matrix, gap penalty = 

25 11, penalty per gap 1, lambda ratio 0.85). The NCBI/GenBank 
database was used. The PSORT algorithm (Nakai) was used, 
using the "plant" option as the source organism (the only 
parameter that can be manually changed) . The Profile Scan 
program at PROSITE (Bairoch) was used to search for motifs in 



64 

VRN2, with default parameters (there are no parameters a user 
select, the results being -hit- or -no hit-). 



can se 



This analysis yielded predictions for two genes, 5K, a 
5 putative nuclear localized protein that is highly 

post-transcriptionally spliced (15 exons) , represented by the 
Columbia cognate cDNA; and a modified prediction for 4450, 
with 6 putatxve membrane -spanning domains, represented at its 
3' end by an Arabidopsis EST (accession number T22412) . 

10 

Determination nf the vr-nP 7 m,,i- = k' 

^^^^-^ 2-1 Mutation and TdPnf H f jcation nf i-u^ 

VRN2 gene 

In an attenpt to determine which gene (5K or 4,50) is VEN2 
15 PCR primers were designed to amplify products encompassing the 
entire predicted open reading frame of both genes. 

Three independent RT-PCR reactions using total RNA prepared 
from fca.l,vr«-l fca-1 and .rn2-2 fc.-l M day-old seedlings 
20 grown on 3« plates in continuous light were performed for each 
predicted gene with a high fidelity enzyme mix (Boehringer 
Mannheim, HiFi System, . These PCR products were sequenced 
.s.ng both the primers used for PCR, and a series of internal 
primers, using the BIGDYE kit (PE Applied Biosystems, The 
2S reactions were run on an „I3,7 machine, and compiled using 
the SeqMan (Drastar, Lasergene) program. 

The sequences o£ the PCR confirmed our prediction for both 
9enes, and indicated that we had amplified across the entire 
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10 



15 



20 



open reading frame of 5K and ATDL4450W as anticipated. 

Several minor polymorphic differences were detected between 
the published Columbia sequence, and the Landsberg erecta 
sequence we amplified by PGR. These differences were 
consistent with the Landsberg erecta genomic sequence in this 
region. Furthermore, the Columbia cDNA for 5K appears to 
utilise a different splice donor site from that used in the 
Landsberg ecotype , and would produce a truncated, presumably 
non- functional protein (Figure 5) . However, we have also 
sequenced the Columbia 5K product derived independently by 
RT-PCR, and this appears to use the same splice site as 
Landsberg, and should encode a functional protein. A 
consistent difference between the vrn2 mutants and fca-1 was 
detected in the 5K PCR product, a G to A change at position 
1201 of the predicted cDNA in vrn2-l fca-1 (Figure 5) . We are 
currently investigating the nature of the mutation in the 
vrn2'2 allele. This type of mutation, a single base-paj.r 
change, is commonly observed following EMS mutagenesis. This 
mutation converts a TGG codon (Tryptophan) to stop codon 
(TGA) , and would result in the production of a truncated 
protein of 322 amino acids in the vrn2-l mutant, compared to 
443 amino acids of wild type VRN2 (Figure 6) . The presence 
of this mutation indicated that 5K was likely to be VRN2 . The 
presence of the vrn2-l mutation in the genome of vrn2-l jfca-I 
mutant plants was confirmed by a derived CAPS (dCAPS) 
(Michaels et ai . , Plant J (1998) 14: 381-385; Neff et al . , 
Plant J (1998) 14: 387-392) marker specific for the V2rn2-1 
mutation (Figure 7) . This diagnostic test is specific for the 
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vrn2~l mutation, as it detects wild type VRN2 in both fca-1 
and vrn2-2 fca-1 mutants. 



Analysis of the VRN2 gene 

5 

To gain an insight into the possible function of the VRN2 
gene, and how the vrn2'l mutation may affect the function of 
the VRN2 protein, we compared the amino acid sequence of VRN2 
to several databases of protein and translated nucleic acid 
10 sequences using BLASTP and TBLASTN programs at NCBI, using the 
default parameters as noted above. 



Several molecules with a significant degree of similarity were 
identified (Table 1 and Table 2) . 

15 

One such gene, represented by a human cDNA (KIAA0160) shares 
homology with VRN2 over a short region near the amino terminus 
of VRN2 (amino acids 63 to 132), and a longer, but less 
conserved region of homology towards the carboxy terminus 

20 (amino acids 263-366) (Figure 8a) . Closer examination of the 
amino terminal conserved region revealed that it matches the 
consensus of a zinc-finger motif. Such motifs can take a 
variety of forms, but all co-ordinate zinc atoms through two 
cysteine residues, and two cysteine or histidine residues. 

25 VRJSf2 falls into the latter class, having a C2H2 motif 

comprised of two cysteines separated by 2 amino acids, and two 
histidines separated by two amino acids (Mackay et al . , TIBS 
(1998) 23: 1-4) . 
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Zinc-finger motifs are known to be capable of mediatinq both 

"'7. 

protem-DNA and protein-pi Vjn interactions. The zinc finger 
motif of VRN2 does not closely resemble the large EPF family 
of Arabidopsis C2H2 zinc fingers from Arahidopsis , which have 
5 a highly conserved QALGG motif in the middle of the zinc 

finger (Kubo et al . , Nucl Acids Res (1998) 26: 608-615) (Figure 
8b) . In addition, VRN2 differs from the EPF proteins in that 
VRN2 has a single zinc finger motif, whereas most members of 
the EPF family (with the exception of SUP and AtZFPl) have 
10 between two and four zinc fingers (Kubo et al . , supra.). This 
amino-terminal region (amino acids 63-132) , and particularly 
the zinc-finger motif (amino acids 90-111) may thus represent 
a domain that mediates protein-protein or protein-DNA 
interactions . 

15 

The carboxy terminus of VRN2 (amino acids 263 to 366) is 
similar to several other candidate genes (Table 1 and Table 
2) . As mentioned above, there is limited homology to the 
human predicted protein KIAA0160. The molecule showing 

2 0 greatest homology to VRN2 is an EST sequence from poplar 

{Populus tjremula L. x Populus tremuloides Michx (Accession 
Number AI163743) (Sterky et ai . , PNAS USA (1998) 95: 
13330-13335) which has 52.8% identity over 127 amino acids 
(Table 1) , as calculated with the BLASTP algorithm using 

25 default parameters (as noted above) . VRN2 also shows 

significant similarity to a predicted Arahldopsis protein 
(ATFCA7_4, Accession Number 2245035) (Bevan et ai . , supra,), 
which is quite close to VRJSr2 on chromosome 4, only 3 0 kb away 
towards the centromere. A close examination of the sequence 
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near this gene revealed that the prediction as annotated may 
be incorrect, as the use of a different splice site, resulting 
a different carboxy terminus to the protein, increases the 
degree of homology with VRN2 . The similarity of these two 
5 Arahidopsis genes raises the possibility that VRN2 may be a 
member of a gene family in Arahidopsis , and their close 
position suggests that these genes may have arisen following a 
duplication. Arguing against this notion is the observation 
that these two genes, VRN2 and ATFCA7_4 , are transcribed in 
10 opposite directions. A rice EST (C72616) also shares 

significant similarity with the carboxyl region of VRN2 , 
suggesting that this region may form an evolut ionarily 
conserved domain present in monocots and dicots . 

15 This conserved carboxy region is predicted to be highly 
charged, as it is composed of a large number of acidic 
residues (D and E) , This highly charged region is highly 
similar in the Poplar and rice ESTs (Table 2) . Such acidic 
regions are found in a numi^er of eukaryotic transcription 

20 factors, and often function as activation domains (Hahn, Cell 
(1993) 72: 481-483). It is therefore possible that VRN2 may 
function as a transcription factor, given it has both a 
DNA-binding motif (or protein binding) (amino acids 63-132) 
and a putative activation domain (amino acids 263-328) . 

25 Furthermore, the amino portion of VRN2 contains two predicted 
nuclear localization signals (NLSs) (Figure 6) . The first is a 
simple 4 residue basic signal, while the second is a 
bi-partite signal, that fits the consensus (R/K) (R/K) NIO (R/K) 4 
(Dingwall at ai . , TIBS (1991) 16: 478-481). 
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EXAMPLE 2 

Production and Characterisation of Arabidopsis Transgenic for 
VRN2 

5 VRN2 cDNA in the sense orientation is cloned into plant 

expression vectors SLJ4D4 and SLJ4K1 (Jones et al . , (1992) 
Transg. Res. 1, pp 285-297) according to the teaching of 
Sambrook J et al (1989) Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory Press, Cold Spring 

0 Harbor, NY. SLJ4D4 and SLJ4K1 place the VRN2 cDNA under the 
control of the Ca MV 3 5S promoter and include terminator 
sequences from octopine synthase (ocs) and nopaline synthase 
(nos) genes, respectively. 

5 Antisense constructs are produced in the same manner except 
that the VRN2 cDNA is inserted into the expression vectors in 
the opposite orientation. 

The VRN2 expression cassettes are then subcloned separately 
0 into the binary vector SLJ1714 (Jones et ai . , supra), and 
mobilised into Agrobacterium strains by tri-parental mating 
according to the teaching of Hoekema et al , , (1983) Nature 
303, pp 179-180. Arabidopsis are transformed with the 
Agrobacterium strains carrying VRN2 expression constructs 
5 (either sense or antisense) following the teaching of Bechtold 
et ai., (1993) C R Acad. Sci . Paris 316, pp 1194-1199. 



Arabidopsis plants are assayed for changes in their response 
to changes in the ratio of far red light, essentially as 
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described by Halliday et ai . , (1997) Plant J. 12, pp 1079- 



1090. 



Resul ts 



5 



Differences in vernalisation requirement and response are 
observed in Aratidopsis plants transgenic for VRN2 (sense or 
antisense orientation) relative to Arabidopsis plants 
transformed with empty vectors and ncn- transformed Arabidopsis 
10 plants. 

EXAMPLE 3 

Production and Characterisation of Tobacco Transgenic for VEN2 

15 VRN2 cDNA in the sense orientation is cloned into plant 

expression vectors SLJ4D4 and SLJ4K1 as described in Exaniple 
2 . 

Antisense constructs are produced in the same manner except 
20 that the VRN2 cDNA is inserted into the expression vectors in 

the opposite orientation. 

VRN2 expression cassettes subcloned separately into the binary 
vector SLJ1714 (Jones et al supra) , are mobilised into 
25 Agrotacterium strains by tri-parental mating as in Example 2, 
and tobacco plants are transformed using the Agrobacteriu^n 
strains carrying VRN2 expression constructs (either sense or 
antisense) following the teaching of Horsch et al . , (1985) 
Science 227, pp 1229-1231. 




Tobacco plants are assayed for changes in their response to 
changes in the ratio of far red light, essentially as 
described by Halliday et al . , (1997) Plant J. 12, pp 1079- 
1090 . 

5 

Resul ts 

Differences in vernalisation requirement and response are 
observed in Tobacco plants transgenic for VRN2 (sense or 
10 antisense orientation) relative to Tobacco plants transformed 
with empty vectors and non- transformed Tobacco plants. 

EXAMPLE 4 

Production and Characterisation of Brassica (Oil Seed Rape, 
15 Winter Type) Transgenic for VRN2 

Sense and antisense constructs are used to generate 
Agrobacterium strains carrying VRN2 expression constructs 
(either sense or antisense) as described in Example 2 and 
20 Example 3, and are used to transform oil seed rape following 
the teaching of Moloney et ai . , (1989) Plant Cell Rep. 8, pp 
238-242 . 



Resul ts 

25 

Differences in vernalisation requirement and response are 
observed in oil seed rape plants transgenic for VRN2 (sense or 
antisense orientation) relative to oil seed rape plants 
transformed with empty vectors and non- transformed oil seed 
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rape plants. 
EXAMPLE 5 

Production and Characterisation of Rice Transgenic for VRN2 

5 

VRN2 cDNA in the sense or antisense orientation is cloned into 
constructs and used to generate respective Agrobacterium 
strains. The Agrojbacteriam strains carrying VRN2 expression 
constructs (either sense or antisense) are used to transform 
10 rice following the teaching of Kohll A et al (1998) Proc . 
Natl. Acad. Sci . USA 95, pp 7203-7208. 



15 Differences in vernalisation requirement and response are 
observed in rice plants transgenic for VRN2 (sense or 
antisense orientation) relative to rice plants transformed 

with empty vectors and non- transformed rice plants. 

2 0 EXAMPLE 6 

Production and Characterisation of Wheat Transgenic for VRN2 

Agrobacterium strains carrying VRN2 expression constructs 
(either sense or antisense) are generated as in preceding 
2 5 Examples and are used to transform wheat following the 

teaching of Becker D et al , , (1994) Plant J. 5, pp 299-307. 



Resul ts 



Results 
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Differences in vernalisation requirement and response are 
.observed in wheat plants transgenic for VRN2 (sense or 
antisense orientation) relative to wheat plants transformed' 
with empty vectors and non- transformed wheat plants. 

METHODS AND MATERIALS 



Plant Growth 

For vernalization treatments, seeds were sown on a damp layer 
10 of fine grit (Levington's M3 ) on wet soil in individual pots, 
and vernalized for increasing durations at 4^C, 8hr light :16hr 
dark, 5 ^mol m"^ sec'^ light intensity. Seed sowing was 
staggered, with all plants removed from the vernalization 
conditions simultaneously. Following vernalization, plants 
15 were placed into a controlled environment chamber 

(Gallenkamp) , 20°C, 16 hr light: 8hr dark 90 ^mol m'^ sec"^ 
light intensity. Plants receiving no vernalization treatment 
were stratified for 2 days under vernalization conditions, and 
grown for two days prior to transfer into the growth cabinet. 
0 Plants were grown for 10 days, and then pricked out into 

individual compartments of P40 trays. The flowering time, as 
measured by counting total leaf number (i.e. rosette and 
cauline leaves) was determined once the primary inflorescence 
had elongated sufficiently. 

5 

The phenotype of vrn2-I fca-1 plants was examined under 
different ratios of red to far-red (R:FR) light. Plants were 
stratified for 2 days, then grown for 10 days under continuous 
light, and then transferred to separate growth chambers with 



white light (W) , R:FR ratio of 5.8, with or without 
supplementary FR light (W+FR) , R:FR ratio 0.08. The number of 
rosette leaves was determined to measure flowering time, and 
the area of the largest leaf was used as an indicator of the 
shade -avoidance response. 



Mapping 

VRN2 was positioned on the long arm of Chromosome 4 (or D) 
through linkage to the RFLP marker m506 (Chang at al . , Proc 
Natl Acad Sci USA (1988) 85: 6856-6860), in progeny of a cross 
between vrn2-l fca-1 {Ler background) and fca-10 (Ws 
background) . Further RFLP markers in this region (Liu at al . , 
Plant J (1996) 10:733-736) were used to refine the position of 
VRN2. We used standard RFLP techniques, using 32P-labeled 
cosmid probes, and a Phosphorlmager detection system. A 
Ler:Ws RFLP was detected when using EcoRI digested genomic 
DNA, and either gl3683 or CC36F6 as probes. The gl9247 marker 
is a SSLP marker (using the primers gl9247F and gl9247R, see 
below) , with Ler producing a PCR-amplif led band of 
approximately 750 bp, while Ws produces a band of 862 bp. 
Fine mapping of VRN2 was performed using a series of 
PCR-derived markers, based on the Columbia genomic sequence in 
this region (Bevan at al., supra.). The VRN2CD marker was 
amplified with two primers, VRN2-C and VRN2-D, and digested 
with Ddel. This produces a CAPS marker, with Ler producing 
bands of (approximately) 480 bp, 290 bp and 190 bp, while Ws 
produces bands of (approximately) 330 bp, 290 bp and 150 bp. 
The VRN2RS marker is a dominant marker for Ws (i.e. Ler:Ws 
heterozygotes cannot be distinguished from Ws) , produced by 



amplifying genomic DNA with VRN2-R and VRN2-S (see below) , and 
digesting the product with MhoII. This produces a single 
predominant band (and several much smaller unresolved bands) 
in Ler of approximately 4 00 bp, while Ws produces two bands, 
4 00 bp and 3 00 bp. 

Cosmid Isolation 

Cosmids covering the region not covered by the subclones of 
EW16B10 YAC were identified through hybridization to BAG 
inserts derived from BACs T5015 and T1C7 . BAG DNA was 
purified and the insert isolated following digestion with NotI 
and separation by pulsed field gel electrophoresis (PFGE) as 
described in (Bancroft et al , , supra.). Purified BAG inserts 
were labeled with a32P-dCTP by random primed labeling, and 
hybridized to arrayed grids of a Ler genomic cosmid library. 
Positively hybridizing cosmids were identified, and rescreened 
following restriction enzyme digestion, southern blotting and 
hybridized to the BAG insert probe used initially. Gosmids in 
the region of interest were selected, and genomic sequence was 
obtained from the ends of the insert using the BIGDYE cycle 
sequencing kit, and T3 and T7 primers, whose sequences flank 
the genomic DNA insert site. This sequence was aligned with 
that of the Columbia genomic sequence to accurately position 
the cosmids . 

Complement at ion 

Gosmids in the 04541 binary vector were mobilized into 
Agrobacterium tumefaciens (strain C58G1 RifR by tri-parental 
mating (Hoekema et al . , Nature (1983) 303: 179-180). V2rn2-1 
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fca~l plants were transformed with these Agrobacterium strains 
by vacuum infiltration (Bechtold et al, supra,). Transgenic 
Tl plants were selected on GM with Kanamycin (50 mg/mL) , and 
transferred to soil when they had reached the 3-4 leaf stage. 
5 The presence of each cosmid in the transgenic lines was 

confirmed using either a Ler/Col specific polymorphism (CAPS 
or SSLP marker) or more commonly, through the use of a 
specific diagnostic PGR reaction, using a primer present 
within the cosmid insert sequence and a primer present in the 

10 cosmid flanking the insert site. Transgenic plants were also 
tested for the presence of the fca-1 mutation , as the TO 
plants should carry this mutation. Together, these two 
methods ensured that only vrn2-l fca-1 plants carrying the 
desired cosmids were analyzed further. We aimed to produced 5 

15 to 6 independent Tl transf ormants for each cosmid, but for 
some cosmids only a single line was produced. However, as 
complementation was observed before we had generated Tls for 
all of the cosmids, we are continuing to produce additional 
transgenic Tl plants carrying the cosmids near VRN2 , Tl 

2 0 plants were grown in a controlled environment chamber 

(conditions as above), and allowed to self. T2 seed were 
collected, and analyzed for the segregation of Kanamycin 
resistance or sensitivity on GM plates containing Kanamycin 
(as above), scored 14-20 days after germination. Progeny from 

25 Tl plants that segregated a 3:1 ratio of resistant to 

sensitive plants were tested for their ability to complement 
the vrn2-I mutant phenotype, by vernalizing for 3 weeks and 
recording the total leaf number. 



Sequencing ORFs 

Two potential open reading frames were sequenced following 
RT-PCR using RNA isolated from fca-1 and vrn2-l fca-1 plants. 
cDNAs were reverse transcribed, primed by a dT12-18 primer, 
and specific primers were then used to PGR amplify regions 
corresponding to the ORFs for ATDL4450W and 5K (VRN2) , The 
Boehringer Mannheim HiFi PGR system was used to increase the 
fidelity of amplification. The primers VRN2-AL and VRN2-AM 
were used to prime ATDL4450W, and VRN2-AI and VRN2 -AJ for 5K. 
For 5K, the PGR reaction for the full length product 
(VRN2A1-VRN2AJ) was inefficient, so subsequent reactions were 
performed to amplify the cDNA in two overlapping fragment, 
using the primer combinations VRN2-AI with VRN2 - AS ; and 
VRN2-A0 with VRN2-AJ. The PGR products were isolated and 
purified, and sequenced directly using the BIGDYE sequencing 
kit (PE Applied Biosystems) . At least two independent PGR 
products were sequenced from each allele. We sequenced the 
ATDL4450W PGR products with the amplifying primers VRN2 -AL 
VRN2-AM, and VRN2 -AU through VRN2-AX. The 5K cDNA was 
sequenced with VRN2-A1, VRN2-AvJ and VRN2 -AO through VRN2-AT, 
Sequences were aligned into contigs using the DNAStar software 
package (LaserGene) . 



Sequence Gomparisons 

EST and cDNA sequences were first translated using the MapDraw 
program of DNAStar (LaserGene) . This revealed several regions 
where the sequence appeared to be incorrect, as small changes 
to the described sequences drastically improved the similarity 
to VRJSr2 . Modified nucleic and amino acid sequences were 
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initially aligned using programs within the MegAlign Package 
of DNAStar (LaserGene) , as follows: 



Amino acid sequences were initially aligned using the Clustal 
5 V method (Higgins and Sharp (1989) CABIOS vol. 5, no . 2 , 151- 
153)- Default parameters were used: Gap penalty 10, Gap 
length penalty 10, ktuple 2, and the PAM 250 residue weight 
table. Alignment was refined by eye and hand, making minor 
adjustments to position of gaps to improve alignment. 

10 

Nucleic acid (EST and cDNA) sequences were initially aligned 
with the Hein method (Hein (1990) Methods in Enzymology, vol. 
183, 626-645), using default parameters: Gap penalty 11, Gap 
length penalty 3, ktuple 2, and the weighted residue weight 
15 table. Where the nucleotide alignment (when translated) did 
not correspond to the amino acid alignment, the positions of 
gaps were adjusted to correspond with the gaps in the amino 
acid alignments. 



2 0 dCAPS Marker for the vrn2-l Mutation 

A derived CAPS (dCAPS ) marker was designed that was specific 
for the vrn2~l mutation. Following PGR amplification from 
genomic DNA with the primers VRN2-AY and VRN2-AZ, the 170 bp 
product was digested overnight with XmnI restriction enzyme, 

25 and the products resolved on a 4% agarose gel. Wild type 
plants (VRN2) produce a single band of 170 bp following 
digestion, while vrn2-l mutants produce two bands, of 137 bp 
and 3 3 bp . 
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Primers Used to Identify VRN2 

As indicated on Figure 3 and Figure 5. All primer sequences 
are indicated 5' to 3 ' . 
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SEQ ID NO: 1 



CAAGCTTCTTCAATTTTGCTTGCTCTCTCTTACACAGCCAATCGGTGTTTTCGCAGCTTTCA 
GGCCTCAATCCAAGACATTCTATATAAGCATATTGCAGAAGAGGCGGTTCTAATTGTTGCAT 
5 T^AGTTTATCGCTATGACGTAGGGAAATTCTAATTTAGGGGAGGCCTCAGAGTTTGCACTAA 
CTTCATAATCGGCTCTTGACGTTGTTGAGTGTAATTGAACAAGAATGTGT^^^^^ 

CGCGCGAAATCCTCACCGGAGGAAGTGATTTCAACTGATGAGAATCTCTTGATATATTGTAA 

ACCTGTTCGACTATATAACATCTTTCACCTTCGCTCTCTAGGCAACCCATCGTTTCTTCCAA 

GATGCTTGAACTACAAAATTGGAGCAAAGCGCAAAAGAAAGTCAAGATCTACTGGGAT^^^^ 

) ^TTTTCAACTATAAGGATTGTAATAACACATTACAGAAAACTGAAGTTAGGGAGGATTGTTC 

TTGTCCATTTTGCTCTATGCTATGTGGTAGCTTCAAGGGGCTGCAATTTCATTTGAATTCAT 

CTCATGATTTATTTGAATTTGAGTTCAAGCTTTTCGAAGAATACCAGACAGTTAATGTTTCT 

GTAAAACTTAATTCCTTCATATTTGAGGAAGAAGGAAGTGATGACGATAAATTTGAGCCCTT 

CTCTCTCTGCTCGAAACCTCGTAAGCGGAGACAAAGAGGTGGCAGAAATAACACCAGGAGAC 

TTAAAGTATGCTTTTTACCGTTGGATTCACCCAGTTTAACTAATGGCACAGAAAATGGAATC 

ACCCTACTTAATGATGGAAACCGTGGTTTAGGATATCCCGAGGCAACAGAGCTTGCTGGACA 

ATTTGAGATGACCAGCAACATTCCACCAGCCATAGCCCACTCTTCTCTGGACGCTGGTGCTA 

AAGTTATATTGACAAGCGAAGCTGTGGTCCCTGCTACTAAGACAAGAAAGTTATCTGCTGAG 

CGATCAGAGGCTAGAAGCCACCTACTTCTTCAGAAACGCCAATTCTATCATTCTCACAGAGT 

CCAGCCAATGGCGCTTGAGCAAGTAATGTCTGACCGGGATAGCGAGGATGAAGTCGATGACG 
ATGTTGCAOAT^^^^^^ 

AAGCAATTCATGCATCTTTGGAACTCGTTTGTAAGAAAACAAAGGGTTATAGCAGAT^^^^ 

TATCTCTTGGGCATGTGAAGCATTTTCAAGATTTTACGAGAAAGAGTTGCACCGTTACTCAT 

CACTCTTCTGGTGTTGGAGATTGTTTTTGATTAAACTATGGAACCATGGACTTGTCGACTCA 

GCCACCATCAACAACTGCAATACCATCCTCGAGAATTGCCGTAATAGCTCAGACACCACCAC 

CACCAACAACAACAACAGTGTGGATCGTCCCAGTGACTCAAAGACCAACAACAATAAGATTG 

TGGATCATCCCAATGACATAAACAACAAGAACAATGTTGACAACAAGGACAATAACAG^ 

GACAAAGTAATTAAATAGGAAAATCTCCGGCTTTTATGATACCGATTTATCGGATTGTAACT 

TATTCTTCTTTCTTAAAAAATTGTTTAGGAGCAAACAAATTTTTTATATGTTAGTGTATTCA 
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ACTGATTACATTTTTAGTTAAAAAAAAAAATGGATTCTGCTTATAACT 
SEQ ID NO: 2 

MCRQNCRAKS S PEEVI STDENLL I YCKPVRLYNI FHLRSLGNPS FLPRCLNYKI GAKRKRKS 

RSTGMWFNYKDCNNTLQKTEVREDCSCPFCSMLCGSFKGLQFHLNSSHDLFEFEFKLFEEY 

QTVNVSVKLNSFIFEEEGSDDDKFEPFSLCSKPRKRRQRGGRNNTRRLKVCFLPLDSPSLTN 

GTENGITLLNDGNRGLGYPEATELoAGQFEMTSNIPPAIAHSSLDAGAKVILTSEAWPATKT 

RKLSAERSEARSHLLLQKRQFYHSHRVQPMALEQVMSDRDSEDEVDDDVADFEDRQMLDDFV 

DVNKDEKQFMHLWNSFVRKQRVIADGHISWACEAFSRFYEKELHRYSSLFWCWRLFLIKLWN 

HGLVDSATimCNTILENCRNSSDTTTTNNNNSVDRPSDSNTNNNNIV^ 

KDNNSRDKVIK 

SEQ ID NO: 3 

AAAGAGAATGCTTTGACTCTCTCATTGGTCAAACCTGACTGTATTTATATGCGTTATTGTGT 

GGTAAAGTTTCGACCTTTGACTTTACAAGTTGGCGTTAAGAAGAGAGATGCGTAGATCAGCG 

AGTGGTTCGAGAGTTTTGGATCATTTTCCCCCGACTTCACGGTCTCCACGTCGATCTCAGAG 

CATTACATCATTGGAAGATGATGTGGAGGTGCTTTTGCCTAGGTACGATCCGAATTCTCAAG 
CGGGGAAGAGAGAGJ^ 

ATTCTTCTTCTCTGTATCGCAATCCTCTGGCTCTCCTCTTATTCAGGTAAGCCGAGAAATTG 
ATTCAATCTCTATGAATCCATAATTGATATGTGAAACTTAATTAGGGATTTTACAAAGGCTC 
ATATGGATATGATATGAGGATCGAGATGTCTCTGTAACATTAGAATCTTGTGTTGAATTATT 
GTTTCAATTTGTTCATATTATACTAAACCGGTGATGGATTTGGAATTTGTCAGCAGCGTTAA 
GGAGTTGAGTTCAAGAAGCAACATGTTGTCTTGTCTCCATGGGAACTCATCATATTCAGTTT 
TGGGAAAGGAAACAATTTTTTTTACCGCCGGTGATTATGTGCCGCAAACCATACGTAACTTT 
TGTAATTTTCGGTTCTGTAGACACATAAAAGGATCTCTCGTTTTCATGAAATGTATGTTTAA 
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TATTTCACTATACATCACACAACTCAAGTAGAAAACACTGATGGTTATCCATTAATCATCAT 
TCTATTGGTCGAAAACAAGGATTAGTTTCAACTTATTGCTACCTTAGTGATTAGATGTTCCT 
GTGAGTTTCAGCTAGCCAAGTCAACTAGAGTTAAACAATGGAATCAAAATACATATTCAGTA 
ATTTATTTTAAACTCTGACTATTTATGTAAACAAAAATGGAAATTAAAATTGAAGGTCATGA 
AGATTCTATTCTTAGTATGAAAAGTATAGATCAATGATAAAAGTATATACCAGAACAGTGGT 
GGATCTAGAAACATATTTAGTATATGGCACAATATATTTAACATATACAAATTTTAATCTAA 
AAGTTGTATTCATTTATGAAAAGACWTCTGAATGAAGCAAATTTATTTGATGTGTTAATCAT 
CCATTTATGTGTTAATCAGCCATTGATGTTAGTATAGTACTCTATGCTAACATAATTTTTTT 
ATACTATAAATTAAAAAAATAGGTAAGAAAAGAAAAATAGATTAATATAAAAAGCATTTTAT 
TAGCTGAAATAAATAAAATGAAAGAAGATAATAACTAATTGACTAAAAAATTAGTAGAGCAT 
ATGGGGCACAATACACTAAGTATTTCATCTTTACTATAAAATGTAACAAATTTCAAAATTAT 
CAAACTGTATATAGGGCACGTGCCTAGGTACCAATAGACGTACGTCCGCCCTGAAATAAGTT 
GGTGAATATGGTTTTAATTCCTCTAATACTCACTGTACTGCCATGGTAGAGGTGAAAAAAAC 
AATTTTAGAAATATTATAATGGATTAAGCTGTCCAAGTTGGTCGTATTTTCTTTACATTTTA 
TTAACTAATAAACATAAATAAGTTCAACTATTTATTGACTAGTAATAATACGTGTAAAATGT 
CTATTGGTTTAAAATATGGGCCATAAGGCCCAGACTTGAAAAAAAAACTTGAAACCCAAAGT 
TATATTTTTACTTGTTTCTTCTTTCTCAGTGAATATCTCCCAATCAAGCTTCTTCGATTTTG 
CTCTCTCTTACACAGCCAATCGGTGTTTTCGCAGCTTTCAGGTTTGTCTCAATCTCAAATTA 
AATCGGAGTCAAGTAATAACAATTGATAAACCTAATTGTTTCCATTGTATTGTAAGATTTGA 
AATTTTGCTGTAGATCCGGAATCGAATTCTAGTTCTGGAATCGTTGATCTCGATGGAATTTT 
TTTTTTAAGATTTCTTCTTACACATTTGGTTCAAAAGATCACATAGTTTTATTTTAATTTGA 
TAAGTATGATGATTCTGCTAAGTGGCATTGGATAAAGTTTTCGTTTTTGCAATACGTCTAAA 
CTTGTCTATGTCTTGAATGAACTCTCTGAGTTGCTTAAAAAGTCTTGTGCTTTCTTTATTAC 
ACAGGCCTCAATCCAAGACATTCTATATAAGCATATTGCAGAAGAGGCGGTTCTAATTGTTG 
CATTGAGTTTATCGCTATGACGTAGGGAAATTCTAATTTAGGGGAGGCCTCAGAGTTTGCAC 
TAACTTCATAATCGGCTCTTGACGTTGTTGAGTGTAATTGAACAAGAATGTGTAGGCAGAAT 
TGTCGCGCGAAATCCTCACCGGAGGAAGTGATTTCAACTGATGAGAATCTCTTGATATATTG 
TAAACCTGTTCGACTATATAACATCTTTCACCTTCGCTCTCTAGGCAACGTATGATTTGCCT 
TCCTCTCTCATCATATTAGCTCAGTAATCTTTCATCTCCTGTGTAGATCACCCACTAATAGT 
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TTGAGTTTGCTAAGCTGATTATGGTCTGATTCATGGCGAGTGTGTGCTTCTTTTGTCTCCTA 

AATTTGAACTTGTTGTTTGTTGTTGCAGCCATCGTTTCTTCCAAGATGCTTGAACTACAAAA 

TTGGAGCAAAGCGCAAAAGAAAGTATGTTTTCTTCTTGAATGTAGCTGCTACAGTGATATGT 

TATTTATCTTACTTCTAATATGGAAGCTGATGACCTATTTTATCTTTGTTGAGTAGATATGG 

ACATAATGAATGGTTTCTTCTTTGTTCATGCTATAAACTTACATTTTATAAAATTGTGTTTT 

GGTTAGGTCAAGATCTACTGGGATGGTAGTTTTCAACTATAAGGATTGTAATAACACATTAC 

AGAAAACTGAAGGTTAGTCTTTTTCTGTTCGTCGACAAAATTCGATGTCAATGTCTATGTTT 

CTCTAGATGATTTGTTATTTACTATTTTTTTCTGTATTGTCATGCAGTTAGGGAGGATTGTT 

CTTGTCCATTTTGCTCTATGCTATGTGGTAGCTTCAAGGTGGGCAACTATTACAACTGAGGT 

TTCTTCCGGGGCCTTTCATATCTAACACTGTGAAATGCTACTGCTGTTTCATGCTGTATACT 

TTCACTGTTTGGTTACATATTTTTGTGTTTGTTGTTTGTCTTCTCACTCTTTTCGAACTGCT 

GAGTGTGTGCTTATCTGAGAAAACATGTCCCAGATGGAGCTTACAACCAATTGTCTTGTGTC 

TATGCAGGGGCTGCAATTTCATTTGAATTCATCTCATGATTTATTTGAATTTGAGTTCAAGG 

TATGTGGTTTTATGGAAATTCTTGATTTGCTATGCCTTTATTAATGAGGTTATAGTTAAAAA 

AGGGTCTTTCCTATTGTAGCTTTCGGAAGAATACCAGACAGTTAATGTTTCTGTAAAACTTA 

ATTCCTTCATATTTGAGGTCAGTTACTTTAAACTTGGTTAATTGGGAAATCCGATAGCTGGT 

GAAAATTTTGTTTATATTCCATCCTTATTTGTACTAGGAAGAAGGAAGTGATGACGATAAAT 

TTGAGCCCTTCTCTCTCTGGTAACCCTCAGAACCCCTTCGATTAAATACCTTAATAGCAGTA 

ACTCCTTGCTTCTCTTGTCAGTACATCTCTGTAAATCCAACCATAATGTTTTGCAGCTCGAA 

ACCTCGTAAGCGGAGACAAAGAGGTGGCAGAAATAACACCAGGAGACTTAAAGTATGCTTTT 

TACCGTTGGATTCACCCAGTTTAACTAATGGCACAGAAAATGGAATCACCCTACTTAATGAT 

GGTAAAATCATATCTTCTTCTGTGCGTTCCTTGTGGCTTAGAACTTCATATTACAGAAGAAG 

ATACAATGGCCTGATTGTTTAGTTTTTGTACTTCTCCTCGCATTCTTCTTGCGAGGGTATTG 

TTACCAGAACTGATGTACAAAATTAATGGCATGCTACAGGAAACCGTGGTTTAGGATATCCC 

GAGGCAACAGARCTTGCTGGACAATTTGAGATGACCAGCAACATTCCACCAGCCATAGCCCA 

CTCTTCTCTGGACGCTGGTGCTAAAGTTATATTGACAAGCGAAGCTGTGGTCCCTGCTACTA 

AGACAAGAAAGTTATCTGCTGAGCGATCAGAGGCTAGAAGGTTTGTTCATCATGACACCCCG 

TCATCATAATTACCATTCCTGTTGTTACAAATGTTCTTCCTATTATGGATAAGTGTTTATAG 

TACTGCCATATTAACCGAGAAAATTTCTTCCAGCCACCTACTTCTTCAGAAACGCCAATTCT 



86 

ATCATTCTCACAGAGTCCAGGTGATCCAAGTTCCTTCACCTACTTCTTAGGCATTTTCTTTA 
AATTGCTCATGATGATATCTTATCAAAGCATACTTGGTTTGTTCTCATCCAAATTTGTATTT 
TGATCTGTATGTATCAACGCAAAATAGTTATGTCCATGTTGTCTCCGTTTTATTGCCACTAA 
CCAAAAAATGCATGTTTCTGTGACAAGCCAATGGCGCTTGAGCAAGTAATGTCTGACCGGGA 
TAGCGAGGATGAAGTCGATGACGATGTTGCAGATTTTGAAGATCGCCAGGTATTCCATGATT 
TCTTTCTGCGTTCATTAAATAGACAACAGAAAATGGTATATGATGTAACTTGCTAATGGCTT 
TTGAAACTTAAAAAAGCTGCAGATGCTTGATGACTTTGTGGATGTGAATAAAGATGAAAAGC 
AATTCATGCATCTTTGGAACTCGTTTGTAAGAAAACAAAGGTAACTACTTCTCTTACACATG 
AACAGACACAAAAAGACCTTATGTCTTACATTCCATACCTGTCTAAATGATTTTGCTTATGG 
AACTTTGAGCTCAATTATGATTGTTGATGTTTCAGGGTTATAGCAGATGGTCATATTTCTTG 
GGCATGTGAAGCATTTTCAAGATTTTACGAGAAAGAGTTGCACCGTTACTCATCACTCTTCT 
GGTAATATAAGTACACCAAACATATACAGACACATAACTACACTATCAATCTTGTTTCGTTT 
TCTGAAAAAAAAATAAAAATTTCCAGGTGTTGGAGATTGTTTTTGATTAAACTATGGAACCA 
TGGACTTGTCGACTCAGCCACCATCAACAACTGCAATACCATCCTCGAGAATTGCCGTAATA 
GCTCAGACACCACCACCACCAACAACAACAACAGTGTGGATCGTCCCAGTGACTCAAACACC 
AACAACAATAACATTGTGGRTCATCCCAATGACATAAACAACAAGAACAATGTTGACAACAA 
GGACAATAACAGCAGAGACAAAGTAATTAAATAGGAAAATCTCCGGCTTTTATGATACCGAT 
TTATCGGATTGTAACTTATTCTTCTTTCTTAAAAAATTGTTTAGGAGCAAACAAATTTTTTA 
TATGTTAGTGTATTCAACTGATTACATTTTTAGTTAAAAAAAAAAATGGATTCTGCTTATAA 
CTAAAAACTGAAAAAAAAGAAAAGTTTCCTTAATTTTTCTTTTTGACTTGAGAAAAAGCTCC 
TCTAGTAAATATGAGTTATATATTAATCAAGTACATAACATAAAAATAGTATATATTAAGTG 
CAAATAGATTGAAAACAAATCAAGAAGAAATTAATTAAGACAGAGTGATTAAGCTTAAAACC 
CCATTTGGACTTGTTCTTTCTCAATGAATCCCTCACAAGCAGCAAGCTTCTTCGATTTTGCT 
TTGACACCACCAATCGGTGTTTTCGAATCTTTCAGGTTTGTCTCGATTTCAATCTAGATCGG 
AGTCAAGTAATAAAATTGATTAACCTAAGTATTCCCGTTCTCTCGTAAGAGTTGGGATTTAG 
CAGTAGATCGGAAATCGGAATTTACGTTTTTGTTAAAAGATTGATGGTTTAGGTAATGGAAC 
ATAGTTCTGGATTCATTGCTTCTAGTTGATTCTCGAATTGTTTGATTTCGCAATGCACATTT 
TTGTTTCAAAGGATCACAGAATTTGATTTAAAATTTGACAAAATTCCATCAATTTCTCATAT 
TAGGGTTTATATTTCTTCTAGTAACTCGAACTTGTTGGAACTCTGTATACTCTGTGCTATGT 
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AGATAAAGTCTTAACATTTTGGTCAACTTTGTTTGATCTCTAAACTAGTTTGGGCTCTCTGT 

TTTAAAGTTTTGTGCTTTCACTATTACACAGGTCTCATACAAGACTACAGTCTCAAGAAGCA 

TAATATCGTCGACTCTGTTTTGAGTTTCTCAACAGTGGTTGAAGCTTAAGGAGGTTCTTATG 
TGCGTTTTGATATC 



SEQ ID NO: 4 

CAAGCTTCTTCAATTTTGCTTGCTCTCTCTCTTACACGGCCAATCGGTGTTTTCGCAGCTTT 

CAGGCCTCAATACAAGACATTCTATATAAGCATATTGCAGAAGAGGCGGTTCTAATTGTTGC 

ATGGAGTTGAACAATATGACGTAGGGAAATTCTAATTTAGGGGAGGCCTCAGAGTTTGCACT 

AACTTCATAATCAGCTCTGGACGTTGTTGATTGTATTTGAACAAGAATGTGTAGGCAGAATT 

GTCGCGCGAAATCCTCACCGGAGGAAGTGATTTCAACTGATGAGAATCTCTTGATATATTGT 

AAACCTGTTCGACTATATAACATCTTTCACCTTCGCTCTCTAGGCAACCCATCGTTTCTGCC 

AAGATGCTTGAACTACAAAATTGGGGCAAAGCGCAAAAGAAAGTCAAGATCTACTGGGATGG 

TAGTTTTCAACTATAAGGATTGTAATAATACATTACAAAGAACTGAAGTTAGGGAGGATTGT 

TCTTGTCCATTTTGCTCTATGCTATGTGGTAGCTTCAAGGGGCTGCAATTTCATTTGAATTC 

ATCTCATGATTTATTTGAATTTGAGTTCAAGCTTTTGGAAGAATACCAGACAGTTAATGTTT 

CTGTAAAACTTAATTCCTTCATATTTGAGGAAGAAGGAAGTGATGATGATAAATTTGAGCCC 

TTCTCTCTCTGCTCGAAACCTCGTAAGCGTAGACAAAGAGGTGGCAGAAATAACACCAGGAG 

ACTTAAAGTATGCTTTTTACCGTTGGATTCACCCAGTTTAGCTAATGGCACAGAAAATGGAA 

TTGCCCTGCTGAATGATGGAAACCGTGGTTTAGGATATCCCGAGGCAACAGAGCTTGCTGGA 

CAATTTGAGATGACTAGCAACATTCCACCAGCCATAGCCCACTCTTCTCTGGACGCTGGTGC 

TAAAGTTATATTAACAACCGAAGCTGTGGTCCCTGCTACTAAGACAAGAAAGTTATCTGCTG 

AGCGATCAGAGGCTAGAAGCCACCTACTTCTTCAGAAACGCCAATTCTATCATTCTCACAGA 

GTCCAGCCAATGGCGCTTGAGCAAGTAATGTCTGATCGGGATAGCGAGGATGAAGTCGATGA 

CGATGTTGCAGATTTTGAAGATCGCCAGATGCTTGATGACTTTGTGGATGTGAATAAAGATG 

AAAAGCAATTCATGCATCTTTGGAACTCGTTTGTAAGAAAACAAAGGGTTATAGCAGATGGT 

CATATCTCTTGGGCATGTGAAGTATTTTCAAGATTTTACGAGAAAGAGTTGCACTGTTACTC 




ATCACTCTTCTGGTGTTGGAGATTGTTTTTGATTAAACTATGGAACCATGGACTTGTCGACT 
CAGCCACCATCAACAACTGCAATACCATCCTCGAGAATTGCCGTAATACCTCAGTCACTAAC 
AACAACAACAACAGTGTGGATCATCCCAGTGACTCAAACACCAACAACAATAACATTGTGGA 
TCATCCGAATGACATAAAAAACAAGAACAATGTTGACAACAAGGACAATAACAGCAGAGACA 
5 AGTAATTAAATAGGAAACACTCCGGTTTAGATGATACCGATCTATCGGATTGTAACTTATTC 
TTCTTTCTTAAAAAAATTGTTTAGGAGCAAACAAAGATTTTATTTGTTAGTGTATTCAACTG 
ATTACATTTTTAGTTAAAAAAATGGATTCTCCTTAATAACT 



10 SEQ ID NO: 5 

MCRQNCRAKSSPEEVISTDENLLIYCKPVRLYNIFHLRSLGNPSFLPRCLNYKIGAKRKRKS 
RSTGMWFNYKDCNNTLQRTEVREDCSCPFCSMLCGSFKGLQFHLNSSHDLFEFEFKLLEEY 
QTVNVSVKLNSFIFEEEGSDDDKFEPFSLCSKPRKRRQRGGRJSnSTTRRLKVCFLPLDSPSLA^ 
15 GTENGIALLNDGNRGLGYPEATELAGQFEMTSNIPPAIAHSSLDAGAKVILTTEAWPATKT 
RKLSAERSEARSHLLLQKRQFYHSHRVQPMALEQVMSDRDSEDEVDDDVADFEDRQMLDDFV 
DVNKDEKQFMHLWNSFVRKQRVIADGHISWACEVFSRFYEKELHCYSSLFWCWRLFLIKLWN 
HGLVDSATINNCNTILENCRNTSVTNNNNNSVDHPSDSNTNNNN 
NNSRDK 

20 

SEQ ID NO: 6 

AAAGAGAAGAGCTTTGACTCTCTCATTGGTCAAACCTGACTGTATTTATATGCGTTATTGTG 
2 5 TGGTAAAGTTTCGACCTTTGACTTGACAAGTTGCCGTTAAGAAGAGAGATGCGTAGATCAGC 
GAGTGGTTCTAGAGTTTTGGATCATTTTCCGGCGACTTCAAGGTCTCCGCCTCGATCTCAGA 
GTGTTACAGCAATGGAAGATGATGTGGAGCTGCTTTTGCCTAGGTACGATCCGAATTCTCAA 
GCGGGGAAGAGAGAGAAATCAAGATTCAGATTTTCAGAAAACGTCATCCATTTGATTCCTCT 
CATTCTTCTTCTCTGTGTCGCAATCCTCTGGCTCTCCTCTTACTCAGGTAAGCCGAGAAATT 



r 



89 



GTTTCAATCTCTATGAATCCATAATTGATCTGTGAAACTTAATTAGGGATTTTACAAAGACT 

CATATGGATATGAGGATCGAGATGTCTGTGCAACGTTAGAATCTTGTGTTGAATTATGGTTT 

CAATTTGTTCATATAATACTAAATCGGTGATGGATTTGGAATTTGTCAGCAGCGTTAAGGAG 

TTGAGTTCCAAAAGCAACATGTTGTCTTGTCTCCATGGGAACTCATATTCAGTTTTGGGAAA 

GGAAACAATTCTTTTACCGCCGGTGATTTTGTGCCGCAAACCATTCGTATTTGTAATTTTTG 

GTTCTGTAGACACACAAAAGGATCTCTCGTTTTCATGAAATGTATGTTTAATATTTCAGTGA 

TATACATCACACAACTCAAGTAGAAAACACTGATGGTTATCCATTAATCATTCTATTGGTCG 

AAAAAAAGATTAGTTTCAACTTAATGCCACCTTAGGATTATATGTTCCTGTGAGTTTCAGCT 

AGCCAACTCAACTAGAGTTAAACAATGGAATCAAAATACATATTCAGTAATTTATTTTAAAC 

TCTGACTATTTATGTAAAACACAAATGGAAATCAAAATTGAAGGTCATGAAGATTCTATTCT 

TAGTGTGAAAAGTATAGATCAATGATTCTTAATTTCTTCATCCTCCACGCATAGATCAATGG 

TGAATATGGTTTTAAATCCTCTAATACTCACTGTACTGCCATGGTAGAGTTAAAAAAACAAT 

TTTAGAAATAT-TAGTGGATTAAGGCATTAAGCTGTCCAAGTTGCTTGTATTTTCTTTTCATT 

TTATTAATTAAAAAAAAAGTTCAACTATTTATTGACTAATAATAATACGTGTTAAATGGTTA 

TCGGTTTAAAATATGGGCCATAGGCCCAGACTTGAAGAAAAACTTGAAACCCAAAGTTTTAT 

TTTTACTTGTTTTCTTTCTCAGTGAATATCTCCCAATCAAGCTTCTTCAATTTTGCTTGCTC 

TCTCTCTTACACGGCCAATCGGTGTTTTCGCAGCTTTCAGGTTTGTCTCAATCTCAAATTAA 

ATCGGAGTCAAGTAATAACAATTGATAACCCTAATTGTTTCAATTATATTGTAJVGATTTGAA 

ATTTTGCAGTAGATCCGGAATCGTATTCTAGTTCTGGAATCGTTGATCTCGATGGAATTTTT 

TTTAAGATTTCTTCATACACATTTGGTTCAAAAGATCACATAATTTTATTTTAATTTGATAA 

GTATGATGATTCTGCTAAGTGGCATTGGATAAAGTTTTCATTTTTGCAATACGTCTAAACTT 

GTCTATGTCTTGAATGAACTCTCTGAGTTGCTTAAAAAGTCTTGTGCTTTCTTTATTACACA 

GGCCTCAATACAAGACATTCTATATAAGCATATTGCAGAAGAGGCGGTTCTAATTGTTGCAT 

GGAGTTGAACAATATGACGTAGGGAAATTCTAATTTAGGGGAGGCCTCAGAGTTTGCACTAA 

CTTCATAATCAGCTCTGGACGTTGTTGATTGTATTTGAACAAGAATGTGTAGGCAGAATTGT 

CGCGCGAAATCCTCACCGGAGGAAGTGATTTCAACTGATGAGAATCTCTTGATATATTGTAA 

ACCTGTTCGACTATATAACATCTTTCACCTTCGCTCTCTAGGCAACGTATGATTTGGCCTTC 

CTCTCTCATCATTTTAGCTTAGTAATCTTTCATCTCCTGTGTAGATCACCCACTAATAGTTT 

GAGTTTGCTAAGCTGATTATGGTCTGACTCATGGCGAGTGTGTGCTTCTTTTGTCTCCTAAT 



GTTATTTGAACTTGTTGTTTGTTGTTGCAGCCATCGTTTCTGCC^^^^^ 

AATTGGGGCAAAGCGCAAAAGAAAGTATGCGTTTCTTCTTGAATGTAGTTGCCACAGTGATA 
TGTTATTTATCTTACTTCTAATATGGAAGCTGATGAACTATTTATCTTTGTTGAGTAGATAT 
GGACATAATGAATGGTTTCTTCTTTGTTCATGCTATACACTTATATTTTACAAAATTGTGTT 

ACAAAGAACTGAAGGTTAGTCTTTTTCTGTTCTTCGACAAAATTCGATGTCAATGTCTATGT 
TTCTCTAGATGATTTGTTATTTACTATTTTTTTCTGTATT^^^^^ 

TTCTTGTCCATTTTGCTCTATGCTATGTGGTAGCTTC^^^^ 

GTTTCTTCCGGGGCCTTTCATATCTAACACTGTGAAATGCTACTGCCGTTTAATGCTATATA 
) CTTTCACTGTTTGGTTACATATTTTTGTGTTTGTTGTTTGTCTTCTTGCTCTTTTTAAACTG 
CTGAGTGTGTGCTTATCTGAGAAAACATGTTCCAGTTCGAGCTTACAATCCATTGTCTTGTG 
TCTATGCAGGGGCTGCAATTTCATTTGAATTCATCTCATGATTTATTTGAATTTGAGTTCAA 
GGTATGTGGTTTTATGGAATTTCTTGTTTTGCCTATGCCGTTAGTAATGAGGTTATAG^^ 

AAAAGGGTCTTTCCTATTGTAGCTTTTGGAAGAATACCAGACAGTTAATGTTTCTGTAAAAC 

TTAATTCCTTCATATTTGAGGTCAGTTACTTTAAACTTGGTTAATTGGGAAATCCTATAGC 

GGTGAAAATTTGGTTTATATTCCATCCTTATTTGTACTAGGAAGAAGGAAGTGATGATGATA 

AATTTGAGCCCTTCTCTCTCTGGTAACTCTCAGAACCCCTTGATTAAATACCTTAATAGCAG 

TAACTCCTTGCTTTTCTTGTCAGTACTTCTCTATAAATCCAACCACAATGTTTTGCAGCTCG 

AAACCTCGTAAGCGTAGACAAAGAGGTGGCAGAAATAACACCAGGAGACTTAAAGTATGCTT 

TTTACCGTTGGATTCACCCAGTTTAGCTAATGGCACAGAAAATGGAATTGCCCTGCTGAATG 

ATGGTAAAATCACATCTTCTTCTGTGGTATTCGTTGTGGCTTAGAACTTCATTTTACAGAAG 

aagatacaatgtcctgattgtttagtt™^^^^ 

ATGTTACCAGAACTGATGTACAAAATTAATGGCATGCTACAGGAAACCGTGGTTTAGGATAT 

CCCGAGGCAACAGAGCTTGCTGGACAATTTGAGATGACTAGCAACATTCCACCAGC^^^^^^ 

CCACTCTTCTCTGGACGCTGGTGCTAAAGTTATATTAACAACCGAAGCTGTGGTCCCTGCTA 

CTAAGACAAGAAAGTTATCTGCTGAGCGATCAGAGGCTAGAAGGTTTGTTCATCATGACACC 

CCGTCATCATAATTACCATACCTGTTGTTACAAATGTTCTTCCTATTATGGATAAGTGTTTA 
CTGTACTGCCATATTAACCGAGAAAATTTCTTCCAGCCACCTAC™^^ 

TCTATCATTCTCACAGAGTCCAGGTGATCCAAGTTCCTTCACCTACTTCTTAGGCATTTTCT 



TTAAATTGCTCATGATGATATCTTATCAAAGCATACTTGGTTTGTTCTCATCTAAATTTGTA 

TTTTGATTCTGTATGTATCAACGCAAAAAAATTATGTCCATGTTGTCTCCGTTTTATTGCCA 

CTAACCAAAAACTGCATGTTTCTTGTGACAAGCCAATGGCGCTTGAGCAAGTAATGTCTGAT 

CGGGATAGCGAGGATGAAGTCGATGACGATGTTGCAGATTTTGAAGATCGCCAGGTATTCCA 

TGATTTCTTTCTGCGTTCATTAAGTAGGCAACAGAAAATGGTATACGATGTAACTTGCTAAT 

GGCTTTTGAAACTTAAAAAAGCTGCAGATGCTTGATGACTTTGTGGATGTGAATAAAGATGA 

AAAGCAATTCATGCATCTTTGGAACTCGTTTGTAAGAAAACAAAGGTAACTACTTCTCTTAC 

ACTTGAACACACACAAAAAGACCTTATGTCTTACATTCCATACCTGTCTAAATGATTCTGCT 

TATGGAACTTTGAGCTCAAATTATGATTGATGTTTGCAGGGTTATAGCAGATGGTCATATCT 

CTTGGGCATGTGAAGTATTTTCAAGATTTTACGAGAAAGAGTTGCACTGTTACTCATCACTC 

TTCTGGTAATATAAGTACACCAAACATATACAGACACATAACTACACTATCAATTTTGTTTC 

GTTTTTCTGAAAGAAAAATAAAAAATTCCAGGTGTTGGAGATTGTTTTTGATTAAACTATGG 

AACCATGGACTTGTCGACTCAGCCACCATCAACAACTGCAATACCATCCTCGAGAATTGCCG 

TAATACCTCAGTCACTAACAACAACAACAACAGTGTGGATCATCCCAGTGACTCAAACACCA 

ACAACAATAACATTGTGGATCATCCGAATGACATAAAAAACAAGAACAATGTTGACAACAAG 

GACAATAACAGCAGAGACAAGTAATTAAATAGGAAACACTCCGGTTTAGATGATACCGATCT 

ATCGGATTGTAACTTATTCTTCTTTCTTAAAAAAATTGTTTAGGAGCAAACAAAGATTTTAT 

TTGTTAGTGTATTCAACTGATTACATTTTTAGTTAAAAAAATGGATTCTCCTTAATAACTAA 

AGACTGAAAAATAAGATAAGTTTCCTTAATTTTTCTTTTTGACTTGAGAAAAAGCTCCTCTA 

GACCTCTAGTAAATAGGAGTTATATATTAATCAAGTACATAACATAAAAATATATATATTAA 

GTGCAAATAGATTGAAAACAAATCAAGAAATTAATTAAGACACAGTGATTAAGCTTAAAACC 

CCATTTTGACTTGTTCTTTCTCAATGAATCCCTCACAAGCAGCAAGCTTCTTCGATTTTGCT 

TTGACACCACCAATCAGTGTTTTCGAATCTTTCAGGTTTGTCTCGATTTCAAACTAGATCGG 

AGTCAAGTGATAAAATTGACTAACATAATTATTCCCGTTCTCTCGTAAGAGTTGGGATTTAG 

CAGTAGATCGGAAATCGGAATTTACGTTTTTGTTAAAAGATTGATGGTTTAGGTAATAAAAC 

ATAGTTCTGGATTCATTGCTTCTAGTTGATTCTCGAATTGTTTGATTTCGCAATGCACATTT 

TTGGTTCAAAGGATCACATAATTTGCTTTAAAATTTGACAAAACATACCATCAAATTTCTCA 

TATTTCTTCAAGTAACTCGAACTTGTTGGAAATCTATATACTCTGGGCTATGTAGATAAAGT 

CTTAACATTTTGGTCAACATTGTTTGTTCTCTAAACTAGTTTGGGTTCTCTGTTTTAAAGTT 



92 

TGGTGCTTTCACTATTACACAGGTCTTATACAA^^^ 

CGACTCTGTTTTGAGTTTCCCAACAGTGGTTGAAGCTTAAGGAGGTTCTT^^^^ 
AAATC 

5 

SEQ ID NO: 7 

CAAGCTTCTTCAATTTTGCTTGCTCTCTCTCTTACACGGCCAATCGGTGTTTTCGCAGCTTT 
CAGGCCTCAATACAAGACATTCTATATAAGCATATTGCAGAAGAGGCGGTTCTAATTGTTGC 
0 ATGGAGTTGAACAATATGACGTAGGGAAATTCTAATTTAGGGGAGGCCTCAGAGTTTGCACT 
AACTTCATAATCAGCTCTGGACGTTGTTGATTGTATTTGAACAAGAATGTGTAGGCAGAATT 
GTCGCGCGAAATCCTCACCGGAGGAAGTGATTTCAACTGATGAGAATCTCTTGATATATTGT 
AAACCTGTTCGACTATATAACATCTTTCACCTTCGCTCTCTAGGCAACCCATCGTTTCTGCC 
AAGATGCTTGAACTACAAAATTGGGGCAAAGCGCAAAAGAAAGTCAAGATCTACTGGGATGG 
TAGTTTTCAACTATAAGGATTGTAATAATACATTACAAAGAACTGAAGTTAGGGAGGATTGT 
TCTTGTCCATTTTGCTCTATGCTATGTGGTAGCTTCAAGGTGGGCAACTATTACAACTGAGG 
GGCTGCAATTTCATTTGAATTCATCTCATGATTTATTTGAATTTGAGTTCAAGCTTTTGGAA 
GAATACCAGACAGTTAATGTTTCTGTAAAACTTAATTCCTTCATATTTGAGGAAGAAGGAAG 
TGATGATGATAAATTTGAGCCCTTCTCTCTCTGCTCGAAACCTCGTAAGCGTAGACAAAGAG 
GTGGCAGAAATAACACCAGGAGACTTAAAGTATGCTTTTTACCGTTGGATTCACCCAGTTTA 
GCTAATGGCACAGAAAATGGAATTGCCCTGCTGAATGATGGAAACCGTGGTTTAGGATATCC 
CGAGGCAACAGAGCTTGCTGGACAATTTGAGATGACTAGCAACATTCCACCAGCCATAGCCC 
ACTCTTCTCTGGACGCTGGTGCTAAAGTTATATTAACAACCGAAGCTGTGGTCCCTGCTACT 
AAGACAAGAAAGTTATCTGCTGAGCGATCAGAGGCTAGAAGCCACCTACTTCTTCAGAAACG 
CCAATTCTATCATTCTCACAGAGTCCAGCCAATGGCGCTTGAGCAAGTAATGTCTGATCGGG 
ATAGCGAGGATGAAGTCGATGACGATGTTGCAGATTTTGAAGATCGCCAGATGCTTGATGAC 
TTTGTGGATGTGAATAAAGATGAAAAGCAATTCATGCATCTTTGGAACTCGTTTGTAAGAAA 
ACAAAGGGTTATAGCAGATGGTCATATCTCTTGGGCATGTGAAGTATTTTCAAGATTTTACG 
AGAAAGAGTTGCACTGTTACTCATCACTCTTCTGGTGTTGGAGATTGTTTTTGATTAAACTA 



TGGAACCATGGACTTGTCGACTCAGCCACCATCAACAACTGCAATACCATCCTCGAGAATTG 

CCGTAATACCTCAGTCACTAACAACAACAACAACAGTGTGGATCATCCCAGTGACTCAAACA 

CCAACAACAATAACATTGTGGATCATCCGAATGACATAAAAAACAAGAACAATGTTGACAAC 

AAGGACAATAACAGCAGAGACAAGTAATTAAATAGGAAACACTCCGGTTTAGATGATACCGA 

TCTATCGGATTGTAACTTATTCTTCTTTCTTAAAAAAATTGTTTAGGAGCAAACAAAGATTT 

TATTTGTTAGTGTATTCAACTGATTACATTTTTAGTTAAAAAAATGGATTCTCCTTAATAAC 
T 



SEQ ID NO: 8 

MCRQNCRAKSSPEEVISTDENLLIYCKPVRLYNIFHLRSLGNPSFLPRCLNYKIGAKRKRKS 
RSTGMWFNYKDCNNTLQRTEVREDCSCPFCSMLCGSFKVGNYYN 

SEQ ID NO: 9 

ACATTTTCGTACCGCTCAAGATTTAAGAAGCGTAAAAGGGTGGAAATCTCAAGTGATAAAAT 
TAGGCATGTACATCCACATATTGTGGATTCAGGATCACCTGAAGATGCCCAGGCAGGATCTG 
AAGACGATTACGTGCAGAGGGAAAATGGTAGTTCTGTAGCACACGCTTCTGTTGATCCTGCT 
AATTCATTACACGGTAGCAATCTTTCAGCACCAACAGTGTTACAGTTTGGGAAGACAAGAAA 
GCTGTCTGTTGAACGAGCTGATCCCAGAAATCGGCAGCTCCTACAAAAACGCCAGTTCTTTC 
ATTCTCACAGGGCTCAACCAATGGCATTGGGAGCAGTTTTCTCAGATCGTGATAGTGAAGAT 
GAGGTTGATGATGACATTGCTGATTTTGAAGATAGACAGATGCTTGATGATTTTGTTGATGT 
TACCAAAGACGAACTTATTATGCATATGG 



SEQ ID NO: 10 



94 

TFSYRSRFKKRKRVEISSDKIRHVHPHIVDSGSPEDAQAGSEDDYVQRENGSSVAHASVDPA 
NSLHGSNLSAPTVLQFGKTRKLSVERADPRNRQLLQKRQFFHSHRAQP^4ALGAVFSDRDSED 
E VDDD I ADFEDRQMLDDFVDVTKDEL I MHM 

SEQ ID NO: 11 

ACATGCATATCCTGATGCTGAATGTGCTCAATTGGTACCTGGGAATAATCTTGCACCTCCTG 

CCATGCTACAATTTGCAAAGACAAGAAAATTATCAATTGAACGGTCTGACATGAGAAACCGT 

ACACTCCTTCACAAACGACAATTTTTTCACTCACATAGAGCTCAGCCAATGGCAGCTGAGCA 

AGTTATGTCAGATCGGGATAGTGAGGATGAAGTTGACGATGATGTTGCAGATTTTGAAGACC 

GAAGGATGCTTGATGATTTTGTAGACGTGACTAAAGATGAGAAGCAAATGATGCACTTGTGG 
AACTCATTTGTGAGG 

SEQ ID NO: 12 

HAYPDAECAQLVPGNNLAPPAMLQFAKTRKLSIERSDMRNRTLLHKRQFFHSHRAQPMAAEQ 
VMSDRDSEDEVDDDVADFEDRRMLDDFVDVTKDEKQMMHLWNSFVR 

SEQ ID NO: 13 ' ^ 

ATGGATCCGATTAAGCTGACAACAGAAGCTAAGGTCCCTGCTAAGCGATCAAAGGCTACAAG 
CCACTACTTGCCTCTTCATAAACGCCAGTTCTATCATTCCCGAACCGGTCAGCCATTGTCAC 
TTGAGCAAGTTATGTCTGACCGAGATAGCGAAAATGACGTCGACAAAAATGATGATGCTGCA 
CATCTCGAAGAAAGCCAGATGCTTAATGGTTCCATGGATGAGAATGAAATCGTAGCAGAGAG 
ATTCATAAAACTTTGGAACTCCTTTGTTAAACAGCAAAGGATTGTTGCAGATGCTCATATTC 
CTTGGGCATGTGAAGCATTCTCAAGATTACACCTGCAAGAGCTGCGCAGTAACTTATCACTC 



GACTTGTGCTGGAGACAATTCATGATCAAACAATGGGATTATGGACTTCTTGACAGAGTCAC 
CATGAACAAATGCAATACCATCATCTACCATAATATCTCAACTACCAACGATGACATAAACA 
ATAACAACACAAGGACGACTGATAATATGGATGTTGTCGACGATGACATAAACAGAGACAAG 

SEQ ID NO: 14 

MDPIKLTTEAKVPAKRSKATSHYLPLHKRQFYHSRTGQPLSLEQVMSDRDSENDVDKNDDAA 
HLEESQMLNGSMDENEIVAERFIKLWNSFVKQQRIVADAHIPWACEAFSRLHLQELRSNLSL 
DLCWRQFM I KQWD YGLLDRVTMNKCNT 1 1 YHNI S TTNDD I NNNNTRTTDNMDWDDD INRDK 

SEQ ID NO: 15 

CTCTGAGGAGACACTTTTTTTTTCCTCCCTCCTTCCCTCCTCTCCTCCTCCCTTCCCTTCCC 
CTCTCCTCCCCTCTCTCCTCCTTCCCCCCTCGGTCCGCCGGAGCCTGCTGGGGCGAGCGGTT 
GGTATTGCAGGCGCTTGCTCTCCGGGGCCGCCCGGCGGGTAGCTGGCGGGGGGAGGAGGCAG 
GAACCGCGATGGCGCCTCAGAAGCACGGCGGTGGGGGAGGGGGCGGCTCGGGGCCCAGCGCG 
GGGTCCGGGGGAGGCGGCTTCGGGGGTTCGGCGGCGGTGGCGGCGGCGACGGCTTCGGGCGG 
CAAATCCGGCGGCGGGAGCTGTGGAGGGGGTGGCAGTTACTCGGCCTCCTCCTCCTCCTCCG 
CGGCGGCAGCGGCGGGGGCTGCGGTGTTACCGGTGAAGAAGCCGAAAATGGAGCACGTCCAG 
GCTGACCACGAGCTTTTCCTCCAGGCCTTTGAGAAGCCAACACAGATCTATAGATTTCTTCG 
AACTCGGAATCTCATAGCACCAATATTTTTGCACAGAACTCTTACTTACATGTCTCATCGAA 
ACTCCAGAACAAACATCAAAAGGAAAACATTTAAAGTTGATGATATGTTATCAAAAGTAGAG 
AAAATGAAAGGAGAGCAAGAATCTCATAGCTTGTCAGCTCATTTGCAGCTTACGTTTACTGG 
TTTCTTCCACAAAAATGATAAGCCATCACCAAACTCAGAAAATGAACAAAATTCTGTTACCC 
TGGAAGTCCTGCTTGTGAAAGTTTGCCACAAAAAAAGAAAGGATGTAAGTTGTCCAATAAGG 
CAAGTTCCCACAGGTAAAAAGCAGGTGCCTTTGATTCCTGACCTCAATCAAACAAAACCCGG 
AAATTTCCCGTCCCTTGCAGTTTCCAGTAATGAATTTGAACCTAGTAACAGCCATATGGTGA 



w 



96 

AGTCTTACTCGTTGCTATTTAGAGTGACTCGTCCAGGAAGAAGAGAGTTTAATGGAATGATT 
AATGGAGAAACCAATGAAAATATTGATGTCAATGAAGAGCTTCCAGCCAGAAGAAAACGAAA 
TCGTGAGGATGGGGAAAAGACATTTGTTGCACAAATGACAGTATTTGATAAAAACAGGCGCT 
TACAGCTTTTAGATGGGGTU^^TATGAAGTAGCCATGCAGGAAATGGAAGAATGTCCAATAAGC 
5 AAGAAAAGAGCAACATGGGAGACTATTCTTGATGGGAAGAGGCTGCCTCCATTCGAAACATT 
TTCTCAGGGACCTACGTTGCAGTTCACTCTTCGTTGGACAGGAGAGACCAATGATAAATCTA 
CGGCTCCTATTGCCAAACCTCTTGCCACTAGAAATTCAGAGAGTCTCCATCAGGAAAACAAG 
CCTGGTTCAGTTAAACCTACTCAAACTATTGCTGTTAAAGAATCATTGACTACAGATCTACA 
AACAAGAAAAGAAAAGGATACTCCAAATGAAAACCGACAAAAATTAAGAATATTTTATCAGT 

10 TTCTCTATAACAACAATACAAGGCAACAAACTGAAGCAAGAGATGACCTGCATTGCCCTTGG 
TGTACTCTGAACTGCCGCAAACTTTATAGTTTACTCAAGCATCTTAAACTCTGCCATAGCAG 
ATTTATCTTCAACTATGTTTATCATCCAAAAGGTGCTAGGATAGATGTTTCTATCAATGAGT 
GTTATGATGGCTCCTATGCAGGAAATCCTCAGGATATTCATCGCCAACCTGGATTTGCTTTT 
AGTCGCAACGGACCAGTTAAGAGAACACCTATCACACATATTCTTGTGTGCAGGCCAAAACG 

15 AACAAAAGCAAGCATGTCTGAATTTCTTGAATCTGAAGATGGGGAAGTAGAACAGCAAAGAA 
CATATAGTAGTGGCCACAATCGTCTGTATTTCCATAGTGATACCTGCTTACCTCTCCGTCCA 
CAAGAAATGGAAGTAGATAGTGAAGATGAAAAGGATCCTGAATGGCTAAGAGAAAAAACCAT 
TACACAAATTGAAGAGTTTTCTGATGTTAATGAAGGAGAGAAAGAAGTGATGAAACTCTGGA 
ATCTCCATGTCATGAAGCATGGGTTTATTGCTGACAATCAAATGAATCATGCCTGTATGCTG 

2 0 TTTGTAGAAAATTATGGACAGAAAATAATTAAGAAGAATTTATGTCGAAACTTCATGCTTCA 
TCTAGTCAGCATGCATGACTTTAATCTTATTAGCATAATGTCAATAGATAAAGCTGTTACCA 
AGCTCCGTGAAATGCAGCAAAAATTAGAAAAGGGGGAATCTGCTTCCCCTGCAAACGAAGAA 
ATAACTGAAGAACAAAATGGGACAGCAAATGGATTTAGTGAAATTAACTCAAAAGAGAAAGC 
TTTGGAAACAGATAGTGTCTCAGGGGTTTCAAAACAGAGCAAAAAACAAAAACTCTGAAAAG 

25 CTCTAACCCCATGTTATGGACAAACACTGAAATTACATTTTAGGGAATTCATCCTCTAAGAA 
TTATGTTTTTGTTTTTAATCATATGTTCCAAACAGGCACTGTTAGATGAAGTAAATGATTTC 
AACAAGGATATTTGTATCAGGGTTCTACTTCACTTCATTATGCAGCATTACATGTATATCAC 
TTTTATTGATGTCATTAAAACATTCTGTACTTTAAGCATGAAAAGCAATATTTCAAAGTATT 
TTTAAACTCAACAAATGTCATCAAATATGTTGAATTGATCTAGAAATTATTTCATATATAAA 



TCAGAATTTTTTTGCATTTATGAACGGCTGTTTTTCTACTTTGTAATTGTGAGACATTTTCT 

TGGGGAGGGAAAATTGGAATGGTTCCCTTTTTTAGAAATTGAAGTGGTCTTCATATGTCAAC 

TACAGAAAAGGAAAAAAATAGAAATTGAAGGATTTTTATGAAATTATATTGCATTACTATTT 

GCAGTCAAACTTTGATCCTTGTTTTTGAAATCATTTGTCAATTCGGAATGAAAAATTATAAT 

GTAATTTTACATTACATAAGTTCCTTTTACAATTAAAAAATAGCACTTCTTCATCTTATGCC 

TGTTTGAGAAGATATTAAATTTTCACATTGTTGACAGTGAAATGCTATGTTGGTTTATAAGA 

TTACAGACCATTTGTTTTCATGTGGATAATTTTAGTGCATTGCTCACCCGGTATGTTTTTTT 

TTTTTAACTTGAACATTTTGCTTGTTTTGTTTTTCTTTTTTAATTAGATAATCACACGGAAA 

ATTAAGCTGTTCATATCTTTAAATTAGGATTGCAAACCAAGGAAAGAACGCATTTGAGATTT 

TAAGATGTCACTTATAAGGGGAGAAGTGTTCTTAAAAAGTCAACCAGAAAACTGTTATGCCT 

TTTATTTGTTTGCAAGGATGTCTTTGTAATGTGTTTCATGAATAGAATATCCAATAGAGATA 

AGCTGACTTGAATCATTTTGAGCAATTTTGCCCTGTGTTATATGTGTTTCACGCACATATTT 

GCAGTTGGATTTTCTCCAACAGAAAGTGGATTCACTACTGGCACATTAACAAGCACCAATAG 

GTTTTTATTCCAACTCCGAGCACTGTGGTTGAGTAACATCACCTCAATTTTTTATTATCCTT 

AAAGATATTGCATTTTCATATTCTTTATTTATAAAGGATCAATGCTGCTGTAAATACAGGTA 

TTTTTAATTTTAAAATTTCATTCCACCACCATCAGATGCAGTTCCCTATTTTGTTTAATGAA 

GGGATATATAAGCTTTCTAATGGTGTCTTCAGAAATTTATAAAATGTAAATACTGATTTGAC 

TGGTCTTTAAGATGTGTTTAACTGTGAGGCTATTTAACGAATAGTGTGGATGTGATTTGTCA 

TCCAGTATTAAGTTCTTAGTCATTGATTTTTGTGTTTAAAAAAAAATAGGAAAGAGGGAAAC 

TGCAGCTTTCATTACAGATTCCTTGATTGGTAAGCTCTCCAAATGATGAGTTCTAGTAAACT 

CTGATTTTTGCCTCTGGATAGTAGATCTCGAGCGTTTATCTCGGGCTTTAATTTGCTAAAGC 

TGTGCACATATGTAAAAAAAAAAAAAAAAAGATTATTTTAGGGGAGATGTAGGTGTAGAATT 

ATTGCTTATGTCATTTCTTAAGCAGTTATGCTCTTAATGCTTAAAAGAAGGCTAGCATTGTT 

TGCACAAAAAGTTGGTGATTCCCACCCCAAATAGTAATAAAATTACTTCTGTTGAGTAAACT 

TTTTATGTCATCGTAAAAGCTGGAAAAATCCCTTTGTTTCTATTTATAAAAAAAGTGCTTTT 

CTATATGTACCCTTGATAACAGATTTTGAAGAAATCCTGTAAGATGATAAAGCATTTGAATG 

GTACAGTAGATGTAAAAAAAATTCAGTTTAAAAGAACATTTGTTTTTACATTAAATGTTTAT 

TTGAAATCAAATGATTTTGTACATAAAGTTCAATAATAT 



SEQ ID NO: 16 



98 



LRRHFPFPPSFPPLLLPSLPLSSPLSSFPPRSAGACWGER.VL8A«L..RP;.<.S»RGEEAG 
TA«PQK«<S3=GGGGSGPSAGSCG«3PGGSMVMATASGGKSa.OSCGG«=SYSASSSSSA 
AA«GMVLPVKKPKMEHVO»HELPLQAFEKPTQIYRFI,RTRNLmPIP,HRTL™SHR« 
SRT»IKRKrF™DHLSKVEKMKGE0ESHSLSAHLOLTPTGPFHKM>KPSPNSENECJ»Sm 
EVLI.VKVCHKKRKDVSCPIRQVPTGKKOVPI,IPD™aXKPGNFPSLAVSSHEPEPSNSHMVK 
SVSLLFRVTRPGRREPNGMINGETNENIDVNEELPARRKRNREDGEKTFVAQMTVPDKNRRL 

qlldgeyevam8Emeecpiskkratwetii,dgkrlppfetfsqgptlqptlrwtge™dkst 

APIAKPLATRNSESLHQENKPGSVKPTOTIAVKESLTTDLQTRKEKDTPNENRQKLRIFVQP 
LYNNNTRQOTEARDDLHCPWCTLNCRKLYSLLKHLKLCHSRFIFNYVYHPKGARIDVSINEC 
YDGSYAGNPQDIHRQPGFAFSRNGPVKRTPITHILVCRPKRTKASMSEFLESEDGEVEOBRT 
YSSGHNRLYFHSDTCLPLRPQEHEVDSEDEKDPEWLREKTITQIEEFSDVNEGEKEVMKLWN 
LHVMKHGFIADNQmCMLFVEMYGQKIIKKNLCRNFMLHWSMHDFHLISIMSIDKAVTK 
LREMQQKLEKGESASPAHEEITEEQHGTANGFSEINSKEKALETDSVSGVSKQSKKQKL 



Figure 1 
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